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(57) Abstract 



A biosynthctic method for producing vitamin C (ascorbic acid, L-ascorbic acid, or AA) is disclosed. Such a method includes 
fermentation of a genetically modified microorganism or plant to produce l^ascorbic acid. In particular, the present invention relates 
to the use of microorganisms and plants having at least one genetic modification to increase the action of an enzyme involved in 
the ascorbic acid biosynthctic pathway. Included is the use of nucleotide sequences encoding epimerases, including the endogenous 
GDP-D-mannose:GDP-L-galactosc epimerasc from the L-ascorbic acid pathway and homologues thereof for the purposes of improving 
the biosynthctic production of ascorbic acid. The present invention also relates to genetically modified microorganisms, such as strains of 
microalgae, bacteria and yeast useful for producing l-ascorbic acid, and to genetically modified plants, useful for producing consumable 
plant food products. 
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VITANON C PRODUCTION IN MICROORGANISMS AND PLANTS 

FIELD OF THE INVENTION 
The present invention relates to vitamin C (L-ascorbic acid) production using 
genetically modified microorganisms and plants. In particular, the present invention 
5 relates to the use of nucleotide sugar epimerase enzymes for the biological production of 
ascorbic acid in plants and microorganisms. 

BACKGROUND OF THE INVENTION 
Nearly all forms of life, both plant and animal, either synthesize ascorbic acid 
(vitamin C) or require it as a nutrient. Ascorbic acid was first identified to be usefiil as a 

10 dietary supplement for humans and animals for the prevention of scurvy. Ascorbic acid, 
however, also afifects human physiological fiinctions such as the adsorption of iron, cold 
tolerance, the maintenance of the adrenal cortex, wound healing, the synthesis of 
polysaccharides and collagen, the formation of cartilage, dentine, bone and teeth, the 
nudntenance of capillaries, and is useful as an antioxidant. 

15 For use as a dietary supplement, ascorbic acid can be isolated firom natural 

sources, such as rosehips, synthesized chemically through the oxidation of L-sorbose, or 
produced by the oxidative fermentation of calcium D-gluconate by Aceiobacter 
suhoxidans. Considine, "Ascorbic Acid," Vcm Nostrcmd's Scientific Encyclopedia, Vol, 
1, pp. 237-238, (1989). Ascorbic acid (predominantly intracellular) has also been 

20 obtained through the fermentation of strains of the microalga, Chlorella pyrenoidosa. See 
U.S. Patent No. 5,001,059 by Skatrud, which is assigned to the assignee of the present 
application. It is believed that ascorbic acid is produced inside the chloroplasts of 
photosynthetic microorganisms and functions to neutralize energetic electrons produced 
during photosynthesis. Accordingly, ascorbic acid production is known in photosynthetic 

25 organisms as a protective mechanism. 

Therefore, products and processes which improve the ability to biosynthetically 
produce ascorbic acid are desirable and beneficial for the improvement of human health. 



30 



SUMMARY OF THE INVENTION 
One embodiment of the present invention relates to a method for producing 
ascorbic acid or esters thereof in a microorganism. The method includes the steps of (a) 
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culturing a microorganism having a genetic modification to increase the action of an 
enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-niannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
5 galactose- 1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone 
dehydrogenase; and (b) recovering the ascorbic acid or esters produced by the 
microorganism. Preferably, the genetic modification is a genetic modification to increase 
the action of an enzyme selected fi-om the group of GDP-D-mannose: GDP-L-galactose 
epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose 

10 dehydrogenase, and/or L-galactono-Y-lactone dehydrogenase. In one embodiment of the 
method of the present invention, the microorganism further includes a genetic 
modification to decrease the action of an enzjrme having GDP-D-mannose as a substrate, 
other than GDP-D-mannose:GDP-L-galactose epimerase. Such a genetic modification 
can include, for example, a genetic modification to decrease the action of GDP-D- 

1 5 maimose-dehydrogenase. 

In one embodiment, the genetic modification is a genetic modification to increase 
the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, which can include GDP-D-maimose:GDP-L-galactose epimerase. In one 
embodiment, the epimerase binds NADPH. In one embodiment of this method, the 

20 genetic modification includes transformation of the microorganism with a recombinant 
nucleic acid molecule that expresses the epimerase. Such an epimerase can have a tertiary 
structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D- 
marmose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code Ibws. Preferably, the epimerase has a structure 

25 having an average root mean square deviation of less than about 2.5 A, and more 
preferably less than about 1 A, over at least about 25% of Ca positions of the tertiary 
structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. 

In one embodiment, the epimerase comprises a substrate binding site having a 

30 tertiary structure that substantially conforms to the tertiary structure of the substrate 
binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
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atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. Such 
a substrate binding site preferably has a tertiary structure with an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a substrate binding site of a GDP-4-keto-6-deoxy-D-mannose 
5 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. 

In another entfxxliment, the epimerase comprises a catalytic site having a tertiary 
structure that substantially conforms to the tertiary structure of the catalytic site of a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates 

10 having Brookhaven Protein Data Bank Accession Code Ibws. Such a catalytic site 
preferably has a tertiary stnicture with an average root mean square deviation of less than 
about 1 A over at least about 25% of Ca positions of the tertiary structure of a catalytic 
site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
coordinates having Brookhaven Protein Data Bank Accession Code Ibws. The catalytic 

15 site preferably includes the amino add residues serine, tyrosine and lysine and in one 
embodiment, the tCTtiary structure positions of the amino add residues serine, tyrosine and 
lysine substantially conform to tertiary structure positions of readues Serl07, Tyrl36 and 
Lysl40, respectively, as represented by atomic coordinates in Brookhaven Protein Data 
Bank Accession Code Ibws. 

20 In yet another embodiment of this method, the epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in the amino add sequence align with 100% identity with at least 
about 50%, and in another embodiment with at least about 75%, and in yet another 
embodiment with at least about 90% of non-Xaa residues in SEQ ID NO: 11. In another 

25 embodiment, the epimerase comprises an amino acid sequence having at least 4 
contiguous amino add residues that are 100% identical to at least 4 contiguous amino acid 
residues of an amino acid sequence selected from the group of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10. In yet another embodiment, 
the recombinant nucleic acid molecule comprises a nucleic acid sequence comprising at 

30 least about 12 contiguous nucleotides having 100% identity with at least about 12 
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contiguous nucleotides of a nucleic acid sequence selected from the group of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9. 

In yet another embodiment of this method of the present invention, the epimerase 
comprises an amino acid sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly. In 
5 yet another embodiment, the recombinant nucleic acid molecule comprises a nucleic acid 
sequence that is at least about 15% identical, and in another embodiment, at least about 
20% identical, and in another embodiment, at least about 25% identical, to a nucleic acid 
sequence selected from the group of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO;7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Upman- 

10 Pearson standard default parameters. 

In yet another embodiment of this method of the present invention, the 
recombinant nucleic add molecule comprises a nucleic add sequence that hybridizes under 
stringent hybridization conditions to a nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-I>-niannose epimerase/reductase. The nucleic acid sequence encoding the GDP-4- 

15 k:eto-6-deoxy-D-mannose epimerase/reductase includes nucleic acid sequences selected 
from tiie group of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, and the GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase can include an amino acid sequence selected 
from the group of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. 

In one embodiment of the method of the present invention, the microorganism is 

20 selected from the group of bacteria, fungi and microaJgae. In one embodiment, the 
microorganism is acid-tolerant. Preferred bacteria include, but are not limited to 
Azotdbacter and Pseudomonas. Preferred fungi include, but are not limited to, yeast, 
including, but not limited to Saccharomyces yeast. Preferred microalgae include, but are 
not limited to, microalgae of the gcnexaProioiheca and CMorella, with microalgae of the 

25 genus Prototheca being particularly preferred. 

In yet another embodiment of the method of the present invention, the 
microorganism is add-tolerant and the step of culturing is conducted at a pH of less than 
about 6.0, and more preferably, at a pH of less than about 5.5, and even more preferably, 
at a pH of less than about 5,0. The step of culturing can be conducted in a fermentation 

30 medium that comprises a carbon source other than D-mannose in one embodiment, and 
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in another embodimmt, the step of culturing is conducted in a fermentation medium that 
comprises glucose as a carbon source. 

In yet another embodiment of the present method, the step of culturing is 
conducted in a fermentation medium that is magnesium (Mg) limited. Preferably, the step 
5 of culturing is conducted in a fermentation medium that is Mg limited during a cell growth 
phase. In one embodiment, the fermentation medium includes less than about 0.5 g/L of 
Mg during a cell growth phase, and more preferably, less than about 0.2 g/L of Mg during 
a ceil growth phase, and even more preferably, less than about 0. 1 g/L of Mg during a cell 
growth phase. 

10 Another embodiment of the present invention relates to a microorganism for 

produdng ascorbic add or esters thereof. The microorganism has a genetic modification 
to increase the action of an enzyme selected fi^om the group of hexokinase, glucose 
phosphate isomerase, phosphomarmose isomerase, phosphomannomutase, GDP-D- 
mannose pyrophosphoryiase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L- 

15 galactose phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, 
and/or I^galactono-y -lactone dehydrogenase. Preferably, the genetic modification is a 
genetic modification to increase the action of an enzyme selected fi'om the group of GDP- 
D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l- 
P-phosphatase, L-galactose dehydrogenase, and/or I^galactono-y -lactone dehydrogenase, 

20 and even more preferably, to increase the action of GDP-D-mannose: GDP-L-galactose 
epimerase. 

In one embodiment, the microorganism has been genetically modified to express 
a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose, wherein the epimerase has a tertiary structure 

25 having an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ccc positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. In another embodiment, the microorganism has been 
genetically modified to express a recombinant nucleic acid molecule encoding an 

30 epimerase that catalyzes conversion of GDP-D-mannose to GDP-l^galactose, wherein the 
epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 1 1 using a 
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CLUSTAL alignment program, wherein amino acid residues in the amino add sequence 
align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 
Preferred microorganisms are disclosed as for the method discussed above. 

Yet another embodiment of the present invention relates to a plant for producing 
5 ascorbic acid or esters thereof. Such a plant has a genetic modification to increase the 
action of an enzyme selected fix)m the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone 

10 dehydrogenase. In a preferred embodiment, the genetic modification is a genetic 
modification to increase the action of an enzyme selected fi'om the group of GDP-D- 
mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone dehydrogenase, 
and in a more preferred embodiment, the genetic modification is a genetic modification 

15 to increase the action of GDP-D-mannose: GDP-L-galactose epimerase. 

In one embodiment, the plant fiirther comprises a genetic modification to decrease 
the action of an enzyme having GDP-D-maimose as a substrate other than GDP-D- 
mannose:GDP-L-galactose epimerase. Such a genetic modification includes a genetic 
modification to decrease the action of GDP-D-mannose-ddiydrogenase. Such a plant also 

20 includes a plant that has been genetically modified to express a recombinant nucleic acid 
molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein the epimerase has a tertiary structure having an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 

25 by atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. In 
another embodiment, such a plant has been genetically modified to express a recombinant 
nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-I>- 
mannose to GDP-I^galactose, wherein the epimerase comprises an amino acid sequence 
that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein amino 

30 add residues in the amino acid sequence align with 100% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO: 1 1 . 
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In one embodiment, a plant for producing ascotbic acid or esters thereof according 
to the present invention is a microalga. Preferred microalgae include, but are not limited 
to microalgae of the genera Prototheca and Chlorella^ with microalga of the genus 
Proioiheca being particularly preferred. In another embodiment, the plant is a higher 
5 plant, with consumable higher plants being more preferred. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 A is a schematic drawing of the pathway from glucose to GDP-D-mannose 
in plants. 

Fig. IB is a schematic drawing of the pathway from GDP-D-mannose to L- 
1 0 galactose- 1 -phosphate in plants. 

Fig. IC is a schematic drawing of the pathway from L^galactose to L-ascorbic acid 
in plants. 

Fig. 2A is a schematic drawing of selected carbon flow from glucose in 
Prototheca. 

15 Fig. 2B is a schematic drawing of selected carbon flow from glucose in 

Prototheca. 

Fig. 3 is a schematic drawing that shows the lineage of mutants derived from 
Prototheca moriformis ATCC 75669, and their ability to produce L-ascorbic acid. 

Fig. 4 is a bar graph illustrating the conversion of substrates by resting cells of 
20 strain NA45-3 following growth in media containing various magnesium concentrations 
and resuspension in media containing various magnesium concentrations. 

Fig. 5 is a line graph showing the relationship between specific ascorbic acid 
formation in cultures of Prototheca strains and the specific activity of GDP-D- 
mannose:GDP-L-galactose epimerase in extracts prepared from cells harvested from the 
25 same cultures. 

Fig. 6 is a line graph showing the relationship between specific epimerase activity 
and the degree of magnesium limitation in two strains, ATCC 75669 and EMS 13-4. 

Fig. 7 depicts the overall catalytic mechanism of GDP-D-mannose: GDP-L- 
galactose epimerase proposed by Barber (1979, Jl Biol. Chem. 254:7600-7603). 
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Fig. 8A depicts the catalytic mechanism of GDP-D-mannose-4,6-dehydratase 
(converts GDP-D-mannose to GDP-4-keto-6-deoxy-D-mannose). 

Fig, 8B depicts the catalytic mechanism of GDP-4-k:eto-^deoxy-D-mannose 
epimerase/reductase (converts GDP-4-keto-6-deoxy-D-maimose to GDP-L-fucose) 
5 (Chang, et a]., 1988, J. Biol. Chem. 263:1693-1697; Bari>er, 1980, Plant Physiol 66:326- 
329). 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to a biosynthetic method and production 
microorganisms and plants for producing vitamin C (ascorbic acid, L-ascorbic acid, or 
0 AA). Such a method includes fermentation of a genetically modified microorganism to 
produce L-ascorbic acid. In particular, the present invention relates to the use of 
nucleotide sequences encoding epimerases, including the endogenous GDP-D- 
mannose: GDP-L-galactose epimerase firom the L-ascorbic acid pathway, as wrfl as 
epimerases having structural homology (e.g,, by nucleotide/amino acid sequence and/or 
tertiary structure of the encoded protein) to GDP-4-k:eto-6-deoxy-D-mannose epimerase/ 
reductases, or UDP-galactose 4-epimerases, for the purposes of improving the 
biosynth^ic production of ascorbic add. The present invention also relates to genetically 
modified microorganisms, such as strains of microalgae, bacteria and yeast useful for 
producing L-ascorbic acid, and to genetically modified plants, useful for producing 
consumable plant food products. 

One embodiment of the present invention relates to a method to produce L- 
ascorbic acid by fermentation of a genetically modified microorganism. This method 
includes the steps of (a) culturing in a fermentation medium a microorganism having a 
genetic modification to increase the action of an enzyme selected fi'om the group of 
hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-mannose pyrophosphorylase, GDP-D-mannose: GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactono-y -lactone dehydrogenase; and (b) recovering 
I^ascorbic acid or esters thereof The various enzymes in this list represent the enzymes 
involved in the vitamin C biosynthetic pathway in plants. It is uncertain at this time 
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whether the enzyme represented by GDP-L-galactose phosphorylase is actually a 
phosphorylase or a pyrophosphoiylase (i.e., GDP-L-galactose pyrophosphorylase). 
Therefore, use of the term "GDP-L-galactose phosphorylase" herein refers to either GDP- 
L^galactose phosphorylase or QDP-L-galactose pyrophosphorylase. In one aspect of the 
5 invention, this method includes the step of culturing in a fermentation medium a 
microorganism having a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. This aspect of the present 
invention is discussed in detail below. 

Another embodiment of the present invention relates to a genetically modified 

10 microorganism for producing L-ascorbic acid or esters thereof Another embodiment of 
the present invention relates to a genetically modified plant for producing L-ascorbic acid 
or esters thereof Both genetically modified microorganisms (e.g., bacteria, yeast, 
microalgae) and plants (e.g., higher plants, microalgae) have a genetic modification to 
increase the action of an enzyme selected fi-om the group of hexokinase, glucose 

15 pho^hate isomerase, phosphomannose isom^ase, phosphomannomutase, GDP-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L- 
galactono-y-lactone dehydrogenase. In a preferred embodiment, both genetically 
modified microorganisms (e.g., bacteria, yeast, microalgae) and plants (e.g., higher plants, 

20 microalgae) have a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. In one embodiment, the 
genetic modification includes the transformation of the microorganism or plant with the 
epimerase as described above. 

To produce significantly high yields of L-ascorbic acid by the method of the 

25 present invention, a plant and/or microorganism is genetically modified to enhance 
production of L-ascorbic acid. As used herein, a genetically modified plant (such as a 
higher plant or microalgae) or microorganism, such as a microalga {Prototheca, 
ChIorel!a\ Escherichia coli, or a yeast, is modified (i.e., mutated or changed) within its 
genome and/or by recombinant technology (i.e., genetic engineering) fi*om its normal (i.e., 

30 wild-type or naturally occurring) form. In a preferred embodiment, a genetically modified 
plant or microorganism according to the present invention has been modified by 
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recombinant technology. Genetic modification of a plant or microorganism can be 
accomplished using classical strain development and/or molecular genetic techniques, 
include genetic engineering techniques. Such techniques are generally disclosed herein 
and are additionally disclosed, for example, in Sambrook et al., 1989, Molecular Cloning: 
5 A Laboratory Manual^ Cold Spring Harbor Labs Press; Roessler, 1995, Plant Lipid 
Metabolism^ pp. 46-48; and Roessler et al., 1994, in Bioconversion for Fuels, Himmel et 
al. eds., American Chemical Society, Washington D.C., pp 255-70). These references are 
incorporated by reference herein in their entirety. 

In some embodiments, a genetically modified plant or microorganism can include 

10 a natural genetic variant as well as a plant or microorganism in which nucleic acid 
molecules have been inserted, deleted or modified, including by mutation of endogenous 
genes (e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such 
a manner that the modifications provide the desired effect within the plant or 
microorganism. As discussed above, a genetically modified plant or microorganism 

1 5 includes a plant or microorganism that has been modified using recombinant technology. 

As used herein, genetic modifications which result in a decrease in gene 
expression, an increase in inhibition of gene expression or inhibition of a gene product 
(i.e., the protein encoded by the gene), a decrease in the fimction of the gene, or a 
decrease in the fimction of the gene product can be referred to as inactivation (complete 

20 or partial), deletion, interruption, blockage, down-regulation, or decreased action of a 
gene. For example, a genetic modification in a gene which results in a decrease in the 
Sanction of the protein encoded by such gene can be the result of a complete deletion of 
the gene encoding the protein (i.e., the gene does not exist, and therefore the protein does 
not exist), a mutation in the gene encoding the protein which results in incomplete or no 

25 translation of the protein (e.g., the protein is not expressed), or a mutation in the gene 
which decreases or abolishes the natural fimction of the protein (e.g., a protein is 
expressed which has decreased or no enzymatic activity). 

Genetic modifications which result in an increase in gene expression or fimction 
can be referred to as amplification, overproduction, overexpression, activation, 

30 enhancement, addition, ui>-regulation or increased action of a gene. Additionally, a genetic 
modification to a gene which modifies the expression, fimction, or activity of the gene can 
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have an impact on the action of other genes and their expression products within a given 
metabolic pathway (e.g., by inhibition or competition). In this embodiment, the action 
(e.g., activity) of a particular gene and/or its product can be afifected (i.e., upregulated or 
downregulated) by a genetic modification to another gene within the same metabolic 
5 pathway, or to a gene within a different metabolic pathway which impacts the pathway of 
interest by competition, inhibition, substrate formation, etc. 

In general, a plant or microorganism having a genetic modification that aflFects L- 
ascorbic add production has at least one genetic modification, as discussed above, which 
results in a change in the L-ascorbic acid production pathway as compared to a wild-type 

10 plant or microorganism grown or cultured under the same conditions. Such a 
modification in an L^ascorbic acid production pathivay changes the ability of the plant or 
microorganism to produce L-ascorbic acid. According to the present invention, a 
genetically modified plant or microorganism preferably has an enhanced ability to produce 
L-ascorbic add compared to a wild-type plant or microorganism cultured under the same 

15 conditions. 

The present invention is based on the present inventors' discovery of the 
biosynthctic pathway for I^ascorbic add (vitamin C) in plants and microorganisms. Prior 
to the present invention, the metabolic pathway by which plants produce L-ascorbic add, 
was not completely eluddated. The present inventors have demonstrated that L-ascorbic 

20 acid production in plants, including L-ascorbic acid-producing microorganisms (e.g., 
microalgae), is a pathway which uses GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. In addition, the present inventors have made the surprising discovery that 
bothL-galactose and I^galactono-y-lactone can be rapidly converted into L-ascorbic add 
in L-ascorbic acid-producing microalgae, including Prototheca and CMorella 

25 pyrertoidosa. The entire pathway for L-ascorbic add production in plants is set forth in 
Figs. 1 A-IC. More particularly, Fig. 1 A shows that the production of L-ascorbic add in 
plants proceeds through the production of mannose intermediates to GDP-D-mannose, 
followed by the conversion of GDP-D-mannose to GDP-L-galactose by GDP-D- 
mannose:GDP-L-galactose epimerase (also known as GDP-D-mannose-3,5-epimerase) 

30 (Fig. IB), and then by the subsequent progression to L-galactose-l-P, L-galactose, L- 
galactonic add (optional), L-galactono-y-Iactone, and L-ascorbic acid (Fig. IC). Fig. IB 
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also illustrates alternate pathways for the use of various intermediates, such as GDP-D- 
mannose. Certain aspects of this pathway have been independently described in a 
publication (Wheeler, et al., 199S, Nature 393:365-369), incorporated herein by reference 
in its entirety. 

5 Points within the L-ascorbic add production pathway which can be targeted by 

genetic modification to affect the production of L-ascorbic acid can generally be 
catagorized into at least one of the following pathways: (a) pathways afifecting the 
production of GDP-D-mannose (e.g., pathways for converting a carbon source into GDP- 
D-mannose); (b) pathways for converting GDP-D-mannose into other compounds, (c) 
10 pathways associated with or downstream of the action of GDP-D-mannose: GDP-L- 
galactose epimerase, (d) pathways which compete for substrates involved in the 
production of any of the intermediates within the I^ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1 -phosphate, L- 
galactose, L-galactona-y-lactone, and/or L-ascorbic acid; and (e) pathways which inhibit 
1 5 production of any of the intermediates within the L-ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-l-phosphate, L- 
galactose, L-galactono-y -lactone, and/or L-ascorbic acid. 

A genetically modified plant or microorganism useful in a method of the present 
invention typically has at least one genetic modification in the L-ascorbic acid production 
pathway which results in an enhanced production of l-ascorbic acid. In one embodiment, 
a genetically modified plant or microorganism has at least one genetic modification that 
results in: (a) an enhanced production of GDP-D-mannose; (b) an inhibition of pathways 
which convert GDP-D-mannose into compounds other than GDP-L-galactose; (c) an 
enhancement of action of the GDP-D-mannose: GDP-L-galactose epimerase; (d) an 
enhancement of the action of enzymes downstream of the GDP-D-mannose: GDP-L- 
galactose epimerase; (e) an inhibition of pathways which compete for substrates involved 
in the production of any of the intermediates within the I^ascorbic acid production 
pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1- 
phosphate, L-galactose, L-galactono-y -lactone, and/or L-ascorbic acid; and (e) an 
inhibition of pathways which inhibit production of any of the intermediates within the L- 
ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L- 
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galactose, L-galactose-1 -phosphate, L-galactose, L-galactono-y -lactone, and/or L- 
ascorbic acid. 

An enhanced production of GDP-D-mannose by genetic modification of the plant 
or microorganism can be achieved by, for example, overexpression of enzymes such as 
5 hexokinase, glucose phosphate isomerase, phosphomannose isomerase (PMI), 
phosphomannomutase (PMM) and/or GDP-D-mannose pyrophosphorylase (GMP). 
Inhibition of pathways which convert GDP-D-mannose to compounds other than GDP-L- 
galactose can be achieved, for example, by modifications which inhibit polysaccharide 
synthesis, GDP-D-rhamnose synthesis, GDP-L-fiicose synthesis and/or GDP-D- 

10 mannuronic acid synthesis. An increase in the action of the GDP-D-mannose: GDP-L- 
galactose epimerase and of enzymes downstream of the epimcrase in the L-ascorbic acid 
production pathway can be achieved by genetic modifications which include, but are not 
limited to: overexpression of the epimerase gene (i.e, by overexpression of a recombinant 
nucleic acid molecule encoding the epimerase gene or a homologue thereof (discussed in 

15 detail below), and/or by mutation of the endogenous or recombinant gene to enhance 
expression of the gene) and/or overexpression of genes downstream of the epimerase 
which encode subsequent en2ymes in the L-ascorbic acid pathway. Finally, metabolic 
pathways which compete with or inhibit the L-ascorbic acid production pathway can be 
inhibited by deleting or mutating enzymes, substrates or products which either inhibit or 

20 compete for an enzyme, substrate or product in the L-ascorbic add pathway. 

As discussed above, a genetically modified plant or microorganism usefiil in the 
method of the present invention can have at least one genetic modification (e.g., mutation 
in the endogenous gene or addition of a recombinant gene) in a gene encoding an enzyme 
involved in the L-ascorbic acid production pathway. Such genetic modifications 

25 preferably increase (i.e., enhance) the action of such enzymes such that L-ascorbic acid 
is preferentially produced as compared to other possible end products in related metabolic 
pathways. Such genetic modifications include, but are not limited to, overexpression of 
the gene encoding such enzyme, and deletion, mutation, or downregulation of genes 
encoding competitors or inhibitors of such enzyme. Preferred enzymes for which the 

30 action of the gene encoding such enzyme can be genetically modified include: hexokinase, 
glucose phosphate isomerase, phosphomannose isomerase (PMI), phosphomannomutase 
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(PMM), GDP-D-mannose pyrophosphorylase (GMP), GDP-D-mannose:GDP-L-galactose 
epimcrase, GDP-L-galactose phosphorylase, L-galadose-l-P-phosphatase, L-galactose 
dehydrogenase, and/or I^gakctono-y-lactone dehydrogenase. More preferably, a 
genetically modified plant or niicroorganism useful in the present invention has a genetic 
5 modification which increases the action of an enzyme selected firom the group of GDP-D- 
mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone dehydrogenase. 
Even more preferably, a genetically modified plant or microorganism useful in the present 
invention has a genetic modification which increases the action of GDP-D-mannose:GDP- 
10 L-galactose epimerase. These enzymes and the reactions catalyzed by such enzymes are 
illustrated in Figs. lA-lC. 

Prior to the present invention, without knowing the L-ascorbic acid biosynthetic 
(i.e., production) pathway, previous mutagenesis and screening efforts were limited in that 
only non-lethal mutations could be detected. One embodiment of the present invention 
relates to elimination of a key competing enzyme that diverts carbon flow firom L-ascorbic 
add synthesis. If such enzyme is absolutely required for growth on glucose, then mutants 
lacking the enzyme (and, therefore, having increased carbon flow to L-ascorbic acid) 
would have been nonviable and not have been detected during prior screening efforts. 
One such enzyme is phosphofi^ctokinase (PFK) (See Fig. 2A). PFK is required for 
growth on glucose, and is the major step drawing carbon away firom L-ascorbic acid 
biosynthesis (Fig. 2A). Elimination of PFK would render the cells nonviable on glucose- 
based media. Selection of a conditional mutant where PFK was inactivated by temperature 
shift, for example, may allow development of a L-ascorbic acid process where cell growth 
is achieved under permissive fermentation conditions, and L-ascorbic acid production 
(from glucose) is initiated by a shift to non-permissive condition. In this example, the 
temperature shift would eliminate carlx)n flow fi-om glucose to glycolysis via PFK, thereby 
shunting carbon into the L-ascorbic acid branch of metabolism. This approach has 
application not only in natural L-ascorbic acid producing organisms, but also in L-ascorbic 
acid recombinant systems (genetically engineered plant or microorganisms) as discussed 
herein. 
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Knowing the identity and mechanism of the rate-limiting pathway enzymes in the 
L-ascorbic add production pathway allows for design of specific inhibitors of the enzymes 
that are also growth inhibitory. Selection of mutants resistant to the inhibitors allows for 
the isolation of strains that contain L-ascorbic add-pathway enzymes with more favorable 
5 kinetic properties. Therefore, one embodiment of the present invention is to identify 
inhibitors of the enzymes that are also growth inhibitory. These inhibitors are then used 
to select genetic mutants that overcome this inhibition and produce L-ascorbic acid at high 
levels. In this embodiment, the resultant plant or microorganism is a non-recombinant 
strain which can then be further modified by recombinant technology, if desired. In 

10 recombinant L-ascorbic add producing strains, random mutagenesis and screening can be 
used as a final step to increase L-ascorbic acid production. 

In yet another embodiment genetic modifications are made to an L-ascorbic add 
produdng organism directly. This allows one to build upon a base of data acquired during 
prior classical strain improvement eflforts, and perhaps more importantly, allows one to 

15 take advantage of undefined beneficial mutations that occurred during classical strain 
improvement. Furthermore, fewer problems are encountered when expressing native, 
rather than heterologous, genes. The most advanced system for development of genetic 
systems for microalgae has been developed for Chlamydomonas reinhardtii. Preferably, 
development of such a genetically modified production organism would include: isolation 

20 of rautant(s) with a specific nutritional requirement for use with a cloned selectable marker 
gene (similar to the uraS mutants used in yeast and fiingal systems); a cloned selectable 
marker such as URA3 or alternatively, identification and cloning of a gene that specifies 
resistance to a toxic compound (this would be analogous to the use of zmtibiotic resistance 
genes in bacterial systems, and, as is the case in yeast and other fungi, a means of 

25 inserting/removing the marker gene repeatedly would be required, unless several different 
selectable markers were developed); a transformation system for introducing DNA into 
the production organism and achieving stable transformation and expression; and, a 
promoter system (preferably several) for high-level expression of cloned genes in the 
organism. 

30 Another embodiment of the present invention, discussed in detail below, is to place 

key genes or allelic variants and homologues thereof firom L-ascorbic acid producing 
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organisms (I.e., higher plants and microalgae) into a plant or microorganism that is more 
amenable to molecular genetic manipulation, including endogenous L-ascorbic acid 
producing microorganisms and suitable plants. For example, it is possible to identify a 
suitable non-pathogenic organism based on the requirement of growth (on glucose) at low 
5 pH (i.e., acid-tolerant organisms, discussed in detail below). 

One suitable candidate for recombinant production in any suitable host organism 
is the gene (nucleic acid molecule) encoding GDP-D-mannose:GDP-L-galactose 
epimerase and homologues of the GDP-D-mannose:GDP-L-galactose epimerase, as well 
as any other epimerase that has structural homology at the primary (i.e., sequence) or 

10 tertiary (i.e., three dimensional) level, to a GDP-4-k:eto-6-deoxy-D-mannose epimerase/ 
reductase, or to a UDP-galactose 4-epimerase. Many microorganisms produce GDP-D- 
mannose as a precursor to exopolysaccharide and glycoprotein production, even though 
such organisms may not make L-ascorbic acid. This aspect of the present invention is 
discussed in detail below. 

1 5 Referring to Figs. 1 A-IC, at least some of the enzymes from glucose-6-phosphate 

to GDP-D-mannose are present in many organisms. In feet, the entire sequence is present 
in bacteria such as Azotobacter vinelandii and Psendomonas aeruginosa, and make up the 
early steps in the biosynthesis of the exopolysaccharide alginate. In this regard, it is 
possible that the only thing preventing these organisms from producing L-ascorbic acid 

20 could be the lack of GDP-D-mannose:GDP-L-galactose epimerase. The presence of PMI, 
PMM and GMP (see Fig. 1 A) in so many organisms is important for two reasons. First, 
these organisms themselves could serve as alternate hosts for L-ascorbic acid production, 
by building on the existing eariy pathway enzymes and adding the required cloned genes 
(the epimerase and possibly others). Second, the genes encoding PMI, PMM and GMP 

25 can be cloned into a new organism where, together with the cloned epimerase, they would 
encode the overall pathway from glucose-6-phosphate to GDP-L- galactose. 

In order to screen genomic DNA or cDNA libraries from different organisms and 
to isolate nucleic acid molecules encoding these enzymes such as the GDP-D- 
mannose:GDP-L-galactose epimerase, one can use any of a variety of standard molecular 

30 and biochemical techniques. For example, the GDP-D-marmose:GDP-L-galactose 
epimerase can be purified from an organism such as Prototheca, the N-terminal amino 



wo 99/64618 PCT/US99/11576 

17 

add sequence can be determined ^eluding, if necessary, the sequence of internal peptide 
finagments), and this information can be used to design degenerate primers for amplifying 
a gene fragment from the organism's DNA. This fragment would then be used to probe 
the library, and subsequently frrigments that hybridize to the probe would be cloned in that 
5 organism or another suitable production organism. There is ample precedent for plant 
enzymes being expressed in an active form in bacteria, such as E. coU, Akematively, 
yeast are also a suitable candidate for developing a heterologous system for L-ascorfoic 
acid productioa 

It is to be understood that the present invention discloses a method comprising the 

10 use of a microorganism with an ability to produce commercially useful amounts of L- 
ascorbic acid in a fermentation process (i.e., preferably an enhanced ability to produce L- 
ascorbic acid compared to a wild-type microorganism cultured under the same 
conditions). This method is achieved by the genetic modification of one or more genes 
encoding a protein involved in an I^ascorbic add pathway which resuks in the production 

15 (expression) of a protein having an altered (e.g., increased or decreased) fiinction as 
compared to the corresponding wild-type protein. Preferably, such genetic modification 
is achieved by recombinant technology. It will be appredated by those of skill in the art 
that production of genetically modified plants or microorganisms having a particular 
ahered function as described dsewhere herdn (e.g., an enhanced ability to produce GDP- 

20 D-mannose:GDP-L-galactose epimerase), such as by transformation of the plant or 
microorganism with a nucleic acid molecule which encodes a particular enzyme, can 
produce many organisms meeting the given fiinctional requirement, albeit by virtue of a 
variety of different genetic modifications. For example, different random nucleotide 
deletions and/or substitutions in a given nucleic acid sequence may all give rise to the 

25 same phenotypic result (e.g., decreased enzymatic activity of the protein encoded by the 
sequence). The present invention contemplates any such genetic modification which 
results in the production of a plant or microorganism having the characteristics set forth 
herein. 

A microorganism to be used in the fermentation method of the present invention 
30 is preferably a bacterium, a fungus, or a microalga which has been genetically modified 
according to the disclosure above. More preferably, a microorganism useful in the present 
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invention is a microalga which is capable of producing L-ascorbic acid, ahhough the 
present invention includes microorganisms which are genetically engineered to produce 
L-ascorbic acid using the knowledge of the key components of the pathway and the 
guidance provided herein. Even more preferably, a microorganism useful in the present 
5 invention is an add-tolerant microorganism, such as microalgae of the genera Prototheca 
and Chlorella. Add-tolerant yeast and bacteria are also known in the art. Acid-tolerant 
microorganisms are discussed in detail below. Particularly prefm-ed microalgae include 
microalgae of the genera, Proiotheca and Chlorella^ with Prototheca being most 
preferred. All known species of Prototheca produce L-ascorbic acid. Production of 

10 ascorbic add by microalgae of the genera Prototheca and Chlorella is described in detail 
in U.S. Patent No. 5,792,631, issued August 11, 1998, andmU.S. Patent No. 5,900,370, 
issued May 4, 1999, both of which are incorporated herein by reference in their entirety. 
Preferred bacteria for use in the present invention include, but are not limited to, 
Azotobcu:ter^ Pseudomonas^ and Escherichia^ although acid-tolerant bacteria are more 

15 preferred. Preferred fungi for use in the present invention include yeast, and more 
preferably, yeast of the genus, Saccharomyces. A microorganism for use in the 
fermentation method of the present invention can also be referred to as a production 
organism. According to the present invention, microalgae can be referred to herein either 
as microorganisms or as plants. 

20 A preferred plant to genetically modify according to the present invention is 

preferably a plant suitable for consumption by animals, including humans. More 
preferably, such a plant is a plant that naturally produces L-ascorbic acid, although other 
plants can be genetically modified to produce L-ascorbic add using the guidance provided 
herein. 

25 The L-ascorbic acid production pathways of the microalgae Prototheca and 

Chlorella pyrenoidosa will be addressed as specific embodiments of the present invention 
are described below. It will be appreciated that other plants and, in particular, other 
microorganisms, have similar L-ascorbic acid pathways and genes and proteins having 
similar structure and function within such pathways. It will also be appreciated that plants 

30 and microorganisms which do not naturally produce L-ascorbic acid can be modified 
according to the present invention to produce L-ascorbic acid. As such, the principles 
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discussed below with regard to Protoiheca and Chlorella pyrenoidosa are applicable to 
other plants and microorganisms, including genetically modified plants and 
microorganisms. 

In one embodiment of the present invention, the action of an enzyme in the L- 
5 ascorbic acid production pathway is increased by amplification of the expression (i.e., 
overcxpression) of an enzyme in the pathway, and particularly, the GDP-D- 
mannose:GDP-L-galactose epimerase, homologues of the epimerase, and/or enzymes 
downstream of the epimerase. Overcxpression of an enzyme can be accomplished, for 
example, by introduction of a recombinant nucleic acid molecule encoding the enzyme. 

10 It is preferred that the gene encoding an enzyme in the L-ascorbic add production 
pathway be cloned under control of an artificial promoter. The promoter can be any 
suitable promoter that will provide a level of enzyme expression required to maintain a 
sufficient level of L-ascorbic acid in the production organism. Preferred promoters are 
constitutive (rather than inducible) promoters, since the need for addition of expensive 

1 5 inducers is therefore obviated. The gene dosage (copy number) of a recombinant nucleic 
acid molecule according to the present invention can be varied according to the 
requirements for maximum product formation. In one embodiment, the recombinant 
nucleic acid molecule encoding a gene in the L-ascorbic acid production pathway is 
integrated into the chromosomes of the microorganism. 

20 It is another embodiment of the present invention to provide a microorganism 

having one or more enzymes in the L-ascorbic acid production pathway with improved 
affinity for its substrates. An enzyme with improved afiBnity for its substrates can be 
produced by any suitable method of genetic modification or protein engineering. For 
example, computer-based protein engineering can be used to design an epimerase protein 

25 with greater stability and better aflSnity for its substrate. See for example, Maulik et al., 
1997, Molecular Biotechnology: Therapeutic Applications and Strategies^ Wiley-Liss, 
Inc., which is incorporated herein by reference in its entirety. 

Recombinant nucleic acid molecules encoding proteins in the L-ascorbic acid 
production pathway can be modified to enhance or reduce the fijnction (i.e., activity) of 

30 the protein, as desired to increase L-ascorbic acid production, by any suitable method of 
genetic modification. For example, a recombinant nucleic acid molecule encoding an 
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enzyme can be modified by any method for inserting, deleting, and/or substituting 
nucleotides, such as by error-prone PCR. In this method, the gene is amplified under 
conditions that lead to a high fi^equcncy of misincorporation errors by the DNA 
polymerase used for the amplification. As a result, a high fi-equency of mutations are 
5 obtained in the PCR products. The resulting gene mutants can then be screened for 
enhanced substrate aflSnity, enhanced enzymatic activity, or reduced^creased inhibitoiy 
ability by testing the mutant genes for the ability to confer increased L-ascorbic acid 
production onto a test microorganism, as compared to a microorganism carrying the non- 
mutated recombinant nucleic acid molecule. 
1 0 Another embodiment of the present invention includes a microorganism in which 

competitive side reactions are blocked, induding all reactions for w^hich GDP-D-mannose 
is a substrate other than the production of L-ascorbic acid. In a preferred embodiment, 
a microorganism having complete or partial inactivation (decrease in the action of) of 
genes encoding enzymes which compete with the GDP-D-mannose: GDP-L-galactose 
epimerase for the GDP-D-mannose substrate is provided. Such enzymes include GDP-D- 
mannase and/or GDP-D-maraiose-dehydrogenase. As used herein, inactivation of a gene 
can refer to any modification of a gene which results in a decrease in the activity 0.e., 
expression or fiinction) of such a gene, including attenuation of activity or complete 
deletion of activity. 

As discussed above, a particularly preferred aspect of the method to produce L- 
ascorbic acid by fermentation of a genetically modified microorganism of the present 
invention includes the step of cuhuring in a fermentation medium a microorganism having 
a genetic modification to increase the action of an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose. According to the present invention, such an 
epimerase can include the endogenous GDP-D-mannose:GDP-I^galactose epimerase from 
the L-ascorbic acid pathway, described above, as well as any other epimerase that has 
structural homology at the primary (i.e., sequence) or tertiary (i.e., three dimensional) 
level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, or to a UDP-galactose 
4-epimerase. Such structural homology is discussed in detail below. Preferably, such an 
epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 
galactose. In one embodiment, the genetic modification includes transformation of the 
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microorganism with a recombinant nucleic add molecule that expresses such an 
epimerase. 

Therefore, the epimerase encompassed in the method and organisms of the 
present invention includes the endogenous epimerase which operates in the naturally 
5 occurring ascorbic acid biosynthetic pathway (referred to herein as GDP-D- 
niannose;GDP-L-galactose epimerase), GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, and any other epimerase which is capable of catalyzing the conversion of 
GDP-D mannose to GDP-L-galactose and which is structurally homologous to a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase. An 

1 0 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose according 
the present invention can be identified by biochemical and fiinctional characteristics as well 
as structural characteristics. For example, an epimerase according to the present invention 
is capable of acting on GDP-D-mannose as a substrate, and more particularly, such an 
epimerase is capable of catalyzing the conversion of GDP-D-maimose to GDP-L- 

15 galactose. It is to be understood that such capabilities need not necessarily be the normal 
or natural iimction of the epimerase as it acts in its endogenous (i.e., natural) environment. 
For example, GDP-4-keto-6-deoxy-D-mannose epimerase/reductase in its natural 
environment under normal conditions, catalyzes the conversion of GDP-D-mannose to 
GDP-L-fucose and does not act directly on GDP-D-mannose (See Fig. 8 A, B), however, 

20 such an epimerase is encompassed by the present invention for use in catalyzing the 
conversion of GDP-D-mannose to GDP-I^galactose for production of ascorbic acid, to 
the extent that it is capable of^ or can be modified to be capable of, catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose. Therefore, the present invention 
includes epimerases which have the desired enzyme activity for use in production of 

25 ascorbic acid, are capable of having such desired enzyme activity, and/or are capable of 
being modified or induced to have such desired enzyme activity. 

In one embodiment, an epimerase according to the present invention includes an 
epimerase that catalyzes the reaction depicted in Fig. 7, In another embodiment, an 
epimerase according to the present invention includes an epimerase that catalyzes the first 

30 of the reactions depicted in Fig. 8B. In one embodiment, an epimerase according to the 
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present invention binds to NADPH, In another embodiment, an epimerase according to 
the present invention is NADPH-dependent for enzyme activity. 

As discussed above, the present inventors have discovered that a key enzyme in 
L-ascorbic acid biosynthesis in plants and microorganisms is GDP-D-mannose:GDP-L- 
5 galactose epimerase (refer to Figs. lA-lC). One embodiment of the invention described 
herein is directed to the manipulation of this enzyme and structural homologues of this 
enzyme to increase L-ascorbic acid production in genetically engineered plants and/or 
microorganisms. More particularly, the GDP-D-mannoseiGDP-L-galactose epimerase of 
the Lr-ascorbic add pathway and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases 

10 are believed to be structurally homologous at both the sequence and tertiary structure 
level; a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is believed to be capable 
of functioning in the L-ascorbic add biosynthetic pathway; and a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase or homologue thereof may be superior to a GDP-D- 
mannose-GDP-I^galactose epimerase for increasing L-ascorbic acid production in 

15 genetically engineered plants and/or microorganisms. Furthermore, the present inventors 
disclose the use of a nucleotide sequence encoding all or part of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase as a probe to identify the gene encoding GDP-D- 
mannose:GDP-L-galactose epimerase. Similarly, the present inventors disclose the use 
of a nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose 

20 epimerase/reductase to design oligonucleotide primers for use in a PCR-based strategy for 
identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose epimerase. 

Without being bound by theory, the present inventors believe that the following 
evidence supports the novel concept that the GDP-D-maimose:GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases have significant 

25 structural homology at the level of sequence and/or tertiary structure, and that the GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductases and/or homologues thereof would be 
useful for production of ascorbic acid and/or for isolating the endogenous GDP-D- 
mannose:GDP-L-galactose epimerase. 

Although prior to the present invention, it was not known that the GDP-D- 

30 mannose:GDP-L-galactose epimerase enzyme (also known as GDP-D-mannose-3,5- 
epimerase) plays a critical role in L-ascorbic add biosynthesis, this enzyme was previously 
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described to catalyze the overall reverable reaction between GDP-D-mannose and GDP- 
I^galactose (Barber, \91\,Arch, Biochem. Biophys. 147:619-623; Barber, \97S,Arch, 
Biochem, Biophys. 167:718-722; Barber, 1979, J. BioL Chem, 254:7600-7603; Hebda, 
et al., 1979, Arch. Biochem. Biophys. 194:496-502; Barber and Hcbda, 1982, Meth. 
5 EnzymoL, 83:522-525). Despite these studies, GDP-D-mannose: GDP-L-galactose 
epimerase has never been well characterized nor has the gene aicoding this enzyme been 
cloned and sequenced. Since the original work by Barber, GDP-D-mannose: GDP-L- 
galactose epimerase activity has been detected in the colorless microalga Proiotheca 
ntoriformis by the assignee of the present ^)plication, and in Arabidopsis ihaliana and pea 

10 embryonic axes (Wheeler, et al., 1998, ibid!). 

Barber (1979, J. Biol Chem, 254:7600-7603) proposed a mechanism for GDP-D- 
mannose:GDP-I^galactose epimerase partially purified fi-om the green microalga 
Chlorellapyrenoidosa, The overall conversion of GDP-D-mannose to GDP-L-galactose 
was proposed to proceed by oxidation of the hexosyl moiety at C-4 to a keto intermediate, 

15 ene-diol formation, and inversion of the configurations at C-3 and C-5 upon rehydration 
of the double bonds and stereospecific reduction of the keto group. The proposed 
mechanism is depicted in Fig. 7. 

Based on Barber's work, Feingold and Avigad (1980, In The Biochemistry of 
Plants, Vol. 3: Carbohydrates; Stmcture and Function, P.K. Stompf and E.E. Conn, eds., 

20 Academic Press, NY) elaborated further on the proposed mechanism for GDP-D- 
mannose: GDP-L-galactose epimerase. This mechanism is based on the assumption that 
the epimerase contains tightly bound NAD ', and transfer of a hydride ion fi-om C-4 of the 
substrate (GDP-D-mannose) to enzyme-associated NAD"^ converts the enzyme to the 
reduced (NADH)form, generating enzyme-bound GDP-4-keto-D-mannose. The latter 

25 would then undergo epimerization by an ene-diol mechanism. The final product (GDP-L- 
galactose) would be released fi-om the enzyme after stereospecific transfer of the hydride 
ion originally removed fi-om C-4, simultaneously regenerating the oxidized form of the 
enzyme. 

L-fiicose (6-deoxy-L-galactose) is a component of bacterial lipopolysaccharides, 
30 mammalian and plant glycoproteins and polysaccharides of plant cell walls. L-fiicose is 
synthesized de novo from GDP-D-mannose by the sequential action of GDP-D-mannose- 
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4,6-dehydratase (an NAD(P)-dependent enzyme), and a bifiinctional GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase (NADPH-dependent), also referred to in scientific 
literature as GDP-fiicose synthetase (Rizzi, et al., 1998, Structure 6:1453-1465; Somers, 
et al., 1998, Structure 6: 1601- 1612). This pathway for L-fiicosc biosynthesis appears to 
5 be ubiquitous (Rizzi, et al., 1998, Structure 6: 1453-1465). The mechanisms for GDP-D- 
mannose-4,6-dchydratase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase are 
shown in Fig. 8A, B (Chang, et al., 1988, J. Biol Chem. 263:1693-1697; Barber, 1980, 
Plant Physiol 66:326-329). 

Comparison of Figs. 7 and 8 A, B reveals that Barber's proposed mechanism for 

1 0 GDP-D-mannose:GDP-L-galactose epimerase is analogous to the reaction mechanism for 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. The same mechanism has also 
been demonstrated for the epimerization reaction that occurs in the biosynthesis of two 
TDP-6-deoxy hexoses, TDP-L-rhamnose and TDP-6-deoxy-L-talose, from TDP-D- 
glucose (Liu and Thorson, 1994, Ann, Rev. Microbiol 48:223-256). In the latter cases, 

1 5 however, the final reduction at C-4 is catalyzed by NADPH-dependent reductases that are 
separate from the epimerase enzyme. These reductases have opposite stereospecifidty, 
providing either TDP-L-rhamnose or TDP-6-deoxy-L-talose (Liu and Thorson, 1994, 
Ann. Rev, Microbiol 48:223-256). 

In all of the mechanisms described above, NAD(P)H is required for the final 

20 reduction at C-4 (refer to Fig. 8B). In the work of Hebda, et al. (1979, Arch Biochem, 
Biophys. 194:496-502), it was reported that GDP-D-mannose:GDP-L-galactose 
epimerase from C. pyrenoidosa did not require NAD, NADP or NADH for activity. 
Strangely, NADPH was not tested. Based on the analogous mechanisms shown in Figs. 
7 and 8A, B, the present inventors believe that it is likely that GDP-D-mannose:GDP-L- 

25 galactose epimerase from C. pyrenoidosa requires NADPH for the final reduction step. 
Why activity was detected in vitro without NADPH addition is not known, but tight 
*binding of NADPH to the enzyme could explain this observation. On the other hand, if 
the proposed mechanism of Feingold and Avigad (1980, in The Biochemistry of Plants, 
Vol. 3, p. 101-170: Carbohydrates; Structure and Function, P.K. Stompf and E.E. Conn, 

30 ed., Acadonic Press, NY) is correct, the reduced enzyme-bound cofector generated in the 
first oxidation step of the epimerase reaction would serve as the source of electrons for 
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the final reduction of the keto group at C-4 back to the alcohol. Thus no addition of 
exogenous reduced co&ctor would be required for activity in vitro. 

Recently, a human gene encoding the bifiinctional GDP-4-keto-6-deoxy-I>- 
mannose epimerase/reductase was cloned and sequenced (Tonetti, et al., 1996, J. Biol. 
5 Chem, 271-27274-27279). This amino add sequence of the human GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase shows significant homology (29% identity) to the E, coli 
GDP-4-keto-6-deoxy-I>mannose epimerase/reductase (Tonetti, et al., 1998, Acta Cryst, 
D54:684-686; Somers, et al., 1998, Structure 6:1601-1612, both of which are 
incorporated herein by reference in their entireties). Tonetti et al, and Somers et al. 

10 additionally disclosed the tertiary (three dimensional) structure of the jE. coli GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase (also known as GDP-fiicose synthetase), and 
noted significant structural homology with another epimerase, UDP-galactose 4-epimerase 
(GalE). These epimerases also share significant homology at the sequence level. Since 
no gene encoding a GDP-D-mannose:GDP-L-galactose epimerase has been cloned and 

1 5 sequenced, homology with genes encoding GDP-4-kcto-6-deoxy-D-mannose epimerase/ 
reductases or with genes encoding a UDP-galactose 4-epimerase has not been 
demonstrated. However, based on the similarity of the reaction products for GDP-D- 
mannose:CiDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase (i.e., GDP-L-galactose and GDP-6-deoxy-L-galactose [i.e., GDP-L-fucose], 

20 respectively) and the common catalytic mechanisms (Figs. 7 and 8A, B) the present 
inventors believe that the genes encoding the enzymes will have a high degree of sequence 
homology, as well as tertiary structural homology. 

Significant structural homology between GDP-D-mannose:GDP-L-galactose 
epimerase and GDP-4-kcto-6-deoxy-D-mannose epimerase/reductases may allow a GDP- 

25 4-keto-6-<leoxy-D-mannose epimerase/reductase, or a homologue thereof, to fimction in 
the L-ascorbic acid biosynthetic pathway, and a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase could potentially be even better than a GDP-D-mannose-GDP-L- 
galactose epimerase for increasing I^ascorbic acid production in genetically engineered 
plants and/or microorganisms. Furthermore, a nucleotide sequence encoding all or part 

30 of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be used as a probe to 
identify the gene encoding GDP-D-mannose:GDP-L-galactose epimerase. likewise, the 
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nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase can be used to design oligonucleotide primers for use in a PCR-based strategy 
for identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose 
epimerase. 

5 The ability to substitute GDP-4-lceto-6-D-mannose epimerase/reductase for GDP- 

D-mannose:GDP-L-gaJax:tose epimerase to enhance L-ascorbic acid biosynthesis in plants 
or microorganisms depends on the ability of GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase to act directly on GDP-D-mannose to form GDP-L-gaiactose. Evidence 
supporting this possibility already exists. Arabidopsis thaliana murJ mutants are defective 

10 in GDP-D-mannose-4,6-dehydratase activity (Bonin, et al., 1997, Proc, Natl Acad. Set 
94:2085-2090). These mutants are thus blocked in GDP-L-fucose biosynthesis, and 
consequently liave less than 2% of the normal amounts of L-fiicose in the primary cell 
walls of aerial portions of the plant (Zablackis, et al., 1996, Science 272:1808-1810). The 
murl mutants are more brittle than wild-type plants, are slightly dwarfed and have an 

1 5 apparently normal life cycle (Zablackis, et al., 272: 1808-1810). When murl mutants are 
grown in the presence of exogenous L-fiicose, the L-fucose content in the plant is restored 
to the wild-type state (Bonin, et al., 1997, Proc. Nail Acad. Set 94:2085-2090). It was 
discovered (Zablackis, et al., 1996, Science 272: 1 808- 1810) that nturl mutants contain, 
in the hemicellulose xyloglucan component of the primary cell wall, L-galactose in place 

20 of the normal L^flicose. L-galactose is not normally found in the xyloglucan component, 
but in nrnrl naitants L-galactose partly replaces the terminal L-fucosyl residue. Bonin, et 
al. (1997, Proc. Nail Acad. Sci. 94:2085-2090) hypothesized that in the absence of a 
functional GDP-D-mannose-4,6-dehydratase in the murl mutants, the GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase normally involved in L-fiicose synthesis may be 

25 able to use GDP-D-mannose directly, forming GDP-L-galactose. Another possibility, 
however, is that the enzymes involved in L-ascorbic acid biosynthesis in A. thaliana are 
responsible for forming GDP-L-galactose in the murl mutant. If this were true, it would 
suggest that in the wild-type plant, some mechanism exists that prevents GDP-L-galactose 
formed in the L-ascorbic acid pathway from entering cell wall biosynthesis and 

30 substituting for (competing with) GDP-L-fiicose for incorporation into the xyloglucan 
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component (since L-galactose is not present in the primary cell wall of the wild-type 
plant). 

Because of the similar reaction mechanisms of Ga3P-D-mannose:GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, and because of the 
5 evidence that GDP-4-keto-6-deoxy-I>-mannose epimerase/reductase can act directly on 
GDP-D-mannose to form GDP-L-galactose, the present inventors believe that genes 
encoding all epimerases and epimerase/reductases that act on GDP-D-mannose have high 
homology. As such, one aspect of the present invention relates to the use of any 
epimerase (and nucleic acid sequences encoding such epimerase) having significant 

1 0 homology (at the primary, secondary and/or tertiary structure level) to a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase or to a UDP-galactose 4-epimerase for the 
purpose of improving the biosynthetic production of L-ascorbic acid. 

Therefore, as described above, one embodiment of the present invention relates 
to a method for producing ascorbic acid or esters thereof in a microorganism, which 

15 includes culturing a microorganism having a genetic modification to increase the action 
of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Also 
included in the present invention are genetically modified microorganisms and plants in 
which the genetic modification increases the action of an epimerase that catalyzes 
conversion of GDP-D-mannose to GDP-L-galactose. 

20 According to the present invention, an increase in the action of the GDP-D- 

mannose:GDP-L-galactose epimerase in the L-ascorbic acid production pathway can be 
achieved by genetic modifications which include, but are not limited to overexpression of 
the GDP-D-mannose:GDP-L^galactose epimerase gene, a homologue of such gene, or of 
any recombinant nucleic acid sequence encoding an epimerase that is homologous in 

25 primary (nucleic acid or amino acid sequence) or tertiary (three dimensional protein) 
structure to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 
4-epimerase, such as by overexpression of a recombinant nucleic acid molecule encoding 
the epimerase gene or a homologue thereof, and/or by mutation of the endogenous or 
recombinant gene to enhance expression of the gene. 

30 According to the present invention, cm epimerase that has a tertiary structure that 

is homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
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reductase is an epimerase that has a tertiary structure that substantially conforms to the 
ternary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws 
(Table 12), In another embodiment, an epimerase that has a tertiary structure that is 
5 homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code IGFS. 
As used herein, a "tertiary structure" or '^three dimensional structure" of a protein, such 

10 temis being interchangeable, refers to the components and the manner of arrangement of 
the components in three dimensional space to constitute the protein. The use of the term 
"substantially conforms" refers to at least a portion of a tertiary structure of an epimerase 
which is sufficiently spatially similair to at least a portion of a specified three dimensional 
configuration of a particular set of atomic coordinates (e.g., those represented by 

15 Brookhaven Protein Data Bank Accession Code Ibws) to allow the tertiary structure of 
at least said portion of the epimerase to be modeled or calculated (i.e., by molecular 
replacement) using the particular set of atomic coordinates as a basis for estimating the 
atomic coordinates defining the three dimensional configuration of the epimerase. 

More particularly, a tertiary structure that substantially conforms to a given set of 

20 atomic coordinates is a structure having an average root-mean-square deviation (RMSD) 
of less than about 2.5 A, and more preferably, less than about 2 A, and, in increasing 
preference, less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most 
preferably, less than about 0.3 A, over at least about 25% of the Ca positions as 
compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 

25 reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code Ibws. In other embodiments, a structure that substantially conforms to 
a given set of atomic coordinates is a structure wherein such structure has the recited 
average root-mean-square deviation (RMSD) value over at least about 50% of the Ca 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

30 epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code Ibws, and in another embodiment, such structure has the 
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recited average root-mean-square deviation (RMSD) value over at least about 75% of the 
Ca positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code Ibws, and in another embodiment, such structure has the 
5 recited average root-mean-square deviation (RMSD) value over about 100% of the Ca 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code Ibws, Methods to calculate RMSD values are well known 
in the art. Various software programs for determining the tertiary structural homology 

10 between one or more proteins are known in the art and are publicly available, such as 
QUANTA (Molecular Simulations Inc.). 

A preferred epimerase that catalyzes conversion of GDP-D-maimose to GDP-L- 
galactose according to the method and genetically modified organisms of the present 
invention includes an epimerase that comprises a substrate binding site having a tertiary 

1 5 structure that substantially conforms to the tertiary structure of the substrate binding site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 
coordinates having Brookhaven Protein Data Bank Accession Code Ibws. Preferably, the 
tertiary structure of the substrate binding site of the epimerase has an average root-mean- 
square deviation (RMSD) of less than about 2.5 A, and more preferably, less than about 

20 2 A, and, in increasing preference, less than about 1.5 A, less than about 1 A, less than 
about 0.5 A, and most preferably, less than about 0.3 A, over at least about 25% of the 
Ca positions as compared to the tertiary structure of the substrate binding site of a GDP- 
4-kcto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates 
having Brookhaven Protein Data Bank Accession Code Ibws. In other embodiments, the 

25 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 50% of the Ca positions 
as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, and in another embodiment, the 

30 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 75% of the Ca positions 
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as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, and in another embodiment, the 
tertiary structure of the substrate binding site of the epimerase has the recited average 
5 root-mean-square deviation (RMSD) value over about 100% of the Ca positions as 
compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, The tertiary structure of the 
substrate binding site of a GDP-4-keto-6-deoxy-D-marmose epimerase/reductase 

10 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code Ibws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary 
structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code IGFS is discussed in detail in Somers et al., 1998, ibid. 

15 Another preferred epimerase according to the present invention includes an 

epimerase that comprises a catalytic site having a tertiary structure that substantially 
conforms to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase represented by the atomic coordinates having Brookhaven 
Protein Data Bank Accession Code Ibws. Preferably, the tertiary structure of the 

20 catalytic site of the epimerase has an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 
less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A, over at least about 25% of the Ca positions as compared to the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epfanerase/reductase 

25 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code Ibws. In other embodiments, the tertiary structure of the catalytic site of the 
epimerase has the recited average root-mean-square deviation (RMSD) value over at least 
about 50% of the Ca positions as compared to the tertiary structure of the catalytic site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 

30 coordinates having Brookhaven Protein Data Bank Accession Code Ibws, and in another 
embodiment, the tertiary structure of the catalytic site of the epimerase has the recited 
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average root-mean-square deviation (RMSD) value over at least about 75% of the Ca 
positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, and in another embodiment, the 
5 tertiary structure of the catalytic site of the epimerase has the recited average root-mean- 
square deviation (RMSD) value over 100% of the Ca positions as compared to the 
tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-maimose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code Ibws. 

10 In one embodiment, an epimerase encompassed by the present invention includes 

an epimerase that has a catalytic she which includes amino add residues: serine, tyrosine 
and lysine. In a preferred embodiment, the tertiary structure positions of the amino add 
residues serine, tyrosine and lysine substantially conform to the tertiary structure position 
of residues Serl07, Tyrl36 and Lysl40, resi>ectively, as represented by atomic 

15 coordinates in Brookhaven Protein Data Bank Accession Code Ibws. The tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-maimose epimerase/reductase 
represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code ibws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

20 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code IGFS is discussed in detail in Somers et al., 1998, ibid. 

In an even more preferred embodiment, the above definition of "substantially 
conforms" can be further defined to include atoms of amino acid side chains. As used 
herein, the phrase "common amino acid side chains" refers to amino add side chains that 

25 are common to both the structures which substantially conforms to a given set of atomic 
coordinates and the structure that is actually represented by such atomic coordinates. 
Preferably, a tertiary structure that substantially conforms to a given set of atomic 
coordinates is a structure having an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 

30 less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A over at least about 25% of the common amino acid side chains as 
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compared to the tertiary structure represented by the given set of atomic coordinates. In 
another embodiment, a structure that substantially conforms to a given set of atomic 
coordinates is a structure having the redted average root-mean-square deviation (RMSD) 
value over at least about 50% of the common amino acid side chains as compared to the 
5 tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such structure has the recited average root-mean-square deviation (RMSD) 
value over at least about 75% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such a structure has the recited average root-mean-square deviation 

10 (RMSD) value over 100% of the conrunon amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates. 

A tertiary structure of an epimerase which substantially conforms to a specified set 
of atomic coordinates can be modeled by a suitable modeling computer program such as 
MODELER (A, Sali and TX. BlundeU, J. Mol Biol, vol. 234:779-815, 1993 as 

15 implemented in the Insight n Homology software package (Insight U (97.0), MSI, San 
Diego)), using information, for example, derived fi-om the following data: (1) the amino 
acid sequence of the epimerase; (2) the amino acid sequence of the related portion(s) of 
the protein represented by the specified set of atomic coordinates having a three 
dimensional configuration; and, (3) the atomic coordinates of the specified three 

20 dimensional configuration. Alternatively, a tertiary structure of an epimerase which 
substantially conforms to a specified set of atomic coordinates can be modeled using data 
generated from analysis of a crystallized structure of the epimerase. A tertiary structure 
of an epimerase vsdiich substantially conforms to a specified set of atomic coordinates can 
also be calculated by a method such as molecular replacement. Methods of molecular 

25 replacement are generally known by those of skill in the art (generally described in 
Brunger. Meih Enzym., vol. 276, pp. 558-580, 1997; Navaza and Saludjian, Meth, 
Enzym.^ vol 276, pp. 581-594, 1997; Tong and Rossmann, A/e//?. Enzym,, vol. 276, pp. 
594-61 1, 1997; and Bentley, Meth, Enzym,, vol. 276, pp. 61 1-619, 1997, each of which 
are incorporated by this reference herein in their entirety) and are performed in a software 

30 program including, for example, XPLOR (Brunger, et al.. Science, vol. 235, p. 458, 
1 987). In addition, a structure can be modeled using techniques generally described by. 
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for example, Sali, Current Opinions in Biotechnology^ vol. 6, pp. 437-451, 1995, and 
algorithms can be implemented in program packages such as Homology 95,0 (in the 
program Insight II, available fromBiosynvMSI, San Diego, CA). Use of Homology 95 .0 
requires an alignment of an amino acid sequence of a known structure having a known 
5 three dimensional structure with an amino acid sequence of a target structure to be 
modeled. The alignment can be a pairwise alignment or a multiple sequence alignment 
including other related sequences (for example, using the method generally described by 
Rost, Meth. Enzymol.^ vol, 266, pp. 525-539, 1996) to improve accuracy. Structurally 
conserved regions can be identified by comparing related structural features, or by 

10 examining the degree of sequence homology between the known structure and the target 
structure. Certain coordinates for the target structure are assigned using known structures 
from the known structure. Coordinates for other regions of the target structure can be 
generated from fragments obtained from known structures such as those found in the 
Protein Data Bank maintained by Brookhaven National Laboratory, Upton, NY. 

15 Conformation of side chains of the target structure can be assigned with reference to what 
is sterically allowable and using a library of rotamers and their frequency of occurrence 
(as generally described in Ponder and Richards, J, MoL Biol, vol. 193, pp. 775-791, 
1987). The resulting model of the target structure, can be refined by molecular mechanics 
(such as embodied in the program Discover, available from Biosym/MSI) to ensure that 

20 the model is chemically and conformationally reasonable. 

According to the present invention, an epimerase that has a nucleic acid sequence 
that is homologous at the primary structure level (i.e., is a homologue of) to a nucleic acid 
sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
galactose 4-epimerase includes any epimerase encoded by a nucleic acid sequence that is 

25 at least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to a nucleic acid sequence 
encoding a GDP-4-kcto-6-deoxy-I>-mannose epimerase/reductase or a UDP-galactose 4- 
epimerase, and preferably to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. 

30 Similariy, an epimerase that has an amino acid sequence that is homologous to an amino 
acid sequence of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
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galactose 4-epimerase includes any epimerase having an amino acid sequence that is at 
least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to an amino acid sequence 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-gsdactose 4- 
5 epimerase, and preferably to an amino acid sequence selected firom the group consisting 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO: 10. 

According to one embodiment of the present invention, homology or percent 
identity between two or more nucleic acid or amino acid sequences is performed using 
methods known in the art for aligning and/or calculating percentage identity. To compare 

10 the homology/percent identity between two or more sequences as set forth above, for 
example, a module contained within DNASTAR (DNASTAR, Inc., Madison, Wisconsin) 
can be used. In particular, to calculate the percent identity between two nucleic acid or 
amino acid sequences, the Lipman-Pearson method, provided by the MegAlign module 
within the DNASTAR program, is preferably used, with the following parameters, also 

15 referred to herein as the Lipman-Pearson standard default parameters: 

(1) Ktuple = 2; 

(2) Gap penalty = 4; 

(3) Gap length penalty = 12. 

Using the Lipman-Pearson method with these parameters, for example, the percent 
20 identity between the amino acid sequence for E, coli GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (SEQ ID NO:4) and human GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (FX) (SEQ ID NO:6) is 27.7%, which is comparable to the 27% 
identity described for these enzymes in Tonetti et al., 1998, Ada Crysi. D54:684-686. 
According to another embodiment of the present invention, to align two or more 
25 nucleic acid or amino acid sequences, for example to generate a consensus sequence or 
evaluate the similarity at various positions between such sequences, a CLUSTAL 
alignment program (e.g., CLUSTAL, CLUSTAL V, CLUSTAL W), also available as a 
module within the DNASTAR program, can be used using the following parameters, also 
referred to herein as the CLUSTAL standard default parameters: 
30 Multiple Alignment Parameters Ti.e., for more than 2 sequences) : 

(1) Gap penalty == 10; 
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(2) Gap length penalty = 10; 

Pairwise Alignment Parameters fi e.. for two sequences): 

(1) Ktuple"l; 

(2) Gap penalty = 3; 
5 (3) Window =5; 

(4) Diagonals saved = 5. 

According to the present invention, a GDP-4-keto-6-deo?ty-D-mannose epimerase/ 
reductase can be a GDP-4-keto-6-deoxy-D-niannose epimerase/reductase from any 
organism, including Arabidopsis thaliana, Escherichia coli^ and human. A nucleic add 

10 sequence encoding a GDP-4-k:eto-6-deoxy-D-mannose epimerase/reductase from 
Arabidopsis thaliana is represented herein by SEQ ID NO: 1 . SEQ ID NO: 1 encodes a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino add sequence 
represented herein as SEQ ID NO:2. A nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase from Escherichia coli is represented herein by 

15 SEQ ID NO:3. SEQ ID NO:3 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase having an amino add sequence represented herein as SEQ ID NO:4. A nudeic 
acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from 
hcmio sapiens is represented herein by SEQ ID NO:5. SEQ ID NO: 5 encodes a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase having an amino add sequence represented 

20 herein as SEQ ID NO:6. 

According to the present invention, a UDP-galactose 4-epimerase can be a UDP- 
galactose 4-epimerase from any organism, including Escherichia coli and human. A 
nucleic acid sequence encoding a UDP-galactose 4-epimerase from Escherichia coli is 
represented herein by SEQ ID NO: 7. SEQ ID NO: 7 encodes a UDP-galactose 4- 

25 epimerase having an amino add sequence represented herein as SEQ ID NO: 8. A nudeic 
add sequence encoding a UDP-galactose 4-epimerase from homo sapiens is represented 
herein by SEQ ID NO:9. SEQ ID NO:9 encodes a UDP-galactose 4-epimerase having 
an amino acid sequence represented herein as SEQ ID NO: 10. 

In a preferred embodiment, an epimerase encompassed by the present invention 

30 has an amino add sequence that aligns with the amino acid sequence of SEQ ID NO: 1 1 , 
for example using a CLUSTAL alignment program, wherein amino acid residues in the 
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amino acid sequence of the epimerase align with 100% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO:l 1, and preferably at least about 75% of non-Xaa 
residues in SEQ ID NO: 1 1, and more preferably, at least about 90% of non-Xaa residues 
in SEQ ID NO: 11, and even more preferably 100% of non-Xaa residues in SEQ ID 
5 NO:l 1. The percent identity of residues aligning with 100% identity with non-Xaa 
residues can be simply calculated by dividing the number of 100% identical matches at 
non-Xaa residues in SEQ ID NO: 1 1 by the total number of non-Xaa residues in SEQ ID 
NOrl 1. A preferred nucleic acid sequence encoding an epimerase encompassed by the 
present invention indude a nucleic add sequence encoding an epimerase having an amino 

10 add sequence with the above described identity to SEQ ID NO: 1 1 . Such an alignment 
using a CLUSTAL alignment program is based on the same parameters as previously 
disclosed herein. SEQ ID NO: 11 represents a consensus amino add sequence of an 
epimerase which was derived by aligning at least portions of amino add sequences SEQ 
ID NO:4, SEQ ID NO:6 and SEQ ID NO:8, as described in Somers et al., 1 998, Structure 

15 6:1 60 1-1612, and can be approximately duplicated using CLUSTAL. 

In another embodiment, an epimerase encompassed by the present invention 
includes an epimerase that has a catalytic site which includes amino acid residues: serine, 
tyrosine and lysine. Preferably, such serine, tyrosine and lysine residues are located at 
positions in the epimerase amino add sequence which align using a CLUSTAL alignment 

20 program with positions Serl05, Tyrl34 and Lysl38 of consensus sequence SEQ ID 
NO:l 1, with positions Serl09, Tyrl38 and Lysl42 of sequence SEQ ID NO:2, with 
positions Serl07, Tyrl36 and Lysl40 of SEQ ID NO:4, with positions Serl 14, Tyrl43 
and Lysl47 of sequence SEQ ID NO:6, with positions Serl24, Tyrl49 and Lysl53 of 
sequence SEQ IDNO:8 or with poshions Serl32, Tyrl57 and Lysl61 of sequence SEQ 

25 ID NO: 10. 

In another embodiment, an epimerase that has an amino add sequence that is 
homologous to an amino acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase includes any epimerase that has an amino acid motif: Gly-Xaa-Xaa- 
GHy-Xaa-Xaa-Gly, which is found, for ocample in positions 8 through 14 of the consensus 
30 sequence SEQ ID NO:l 1, in positions 12 through 18 of SEQ ID N0:2, in positions 10 
through 16 of SEQ IDNO:4, in positions 14 through 20 of SEQ ID NO: 6, in positions 
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7 through 13 of SEQ ID N0:8, and in positions 9 through 15 of SEQ ID NO: 10. Such 
a motif can be identified by its alignment with the same motif in the above-identified amino 
acid sequences using a CLUSTAL alignment program. Preferably, such motif is located 
within the first 25 N-terminal amino acids of the amino acid sequence of the epimerase. 
5 In yet another embodiment, an epimerase encompassed by the present invention 

includes an epimerase that has a substrate binding site which includes amino acid residues 
that align using a CLUSTAL alignment program with at least 50% of amino acid 
positions Asnl77, Serl78, Argl87, Arg209, Lys283, Asnl65, Serl07, Serl08, Cysl09, 
Asnl33, Tyrl36 andHisl79 of SEQ IDNO:4. Alignment with positions Serl07, Tyrl36, 

10 Asnl65, Arg209, is preferably with 100% identity (i.e., exact match of residue, under 
parameters for alignment). 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention comprises at least 4 contiguous amino acid residues having 100% 
identity with at least 4 contiguous amino acid residues of an amino acid sequence selected 

15 fi-om the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ 
ID NO: 10, as determined using a Lipman-Pearson method with Lipman-Pearson standard 
default parameters or by comparing an alignment using a CLUSTAL program with 
CLUSTAL standard default parameters. According to the present invention, the term 
"contiguous" means to be connected in an unbroken sequence. For a first sequence to 

20 have "100% identity^* with a second sequence means that the first sequence exactly 
matches the second sequence with no gaps between nucleotides or amino acids. 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention is encoded by a nucleic acid sequence that comprises at least 12 
contiguous nucleic acid residues having 100% identity with at least 12 contiguous nucleic 

25 acid residues of a nucleic acid sequence selected fi*om the group of SEQ ID NO:l, SEQ 
ID NO:3, SEQ ED NO:5, SEQ ID NO:7 or SEQ ID NO: 10, as determined using a 
Lipman-Pearson method with Lipman-Pearson standard default parameters or by 
comparing an alignment using a CLUSTAL program with CLUSTAL standard default 
parameters. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that hybridizes under stringent 
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hybridization conditions to a nucleic acid sequence selected from the group of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5. SEQ ID NO:7 or SEQ ID NO:9. As used herein, 
stringent hybridization conditions refer to standard hybridization conditions under which 
nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard 
5 conditions are disclosed, for example, in Sambrook et al.. Molecular Cloning: A 
Laboratory Manual^ Cold Spring Harbor Labs Press, 1989. Sambrook ct al., ibid.^ is 
incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In 
addition, formulae to calculate the appropriate hybridization and wash conditions to 
achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, 

10 for example, in Meinkoth et al., 1984, AnaL Biochem. 138, 267-284; Meinkoth et al., 
ibid,^ is incorporated by reference herein in its entirety. 

More particularly, stringent hybridization and washing conditions, as referred to 
herein, refer to conditions \\iiich permit isolation of nucleic acid molecules having at least 
about 70% nucleic acid sequence identity with the nucleic add molecule being used to 

15 probe in the hybridization reaction, more particulariy at least about 75%, and most 
particularly at least about 80%. Such conditions will vary, depending on whether 
DNA:RNA or DNAiDNA hybrids are being formed. Calculated melting temperatures for 
DNA:DNA hybrids are 10*'C less than for DNA:RNA hybrids. In particular 
embodiments, stringent hybridization conditions for DNA:DNA hybrids include 

20 hybridization at an ionic strength of 6X SSC (0.9 M Na*) at a temperature of between 
about 20 °C and about 35 °C, more preferably, between about 28 °C and about 40 °C, and 
even more preferably, between about 35**C and about 45**C. In particular embodiments, 
stringent hybridization conditions for DNA:RNA hybrids include hybridization at an ionic 
strength of 6X SSC (0.9 M Na*) at a temperature of between about 30°C and about 

25 45**C, more preferably, between about 38**C and about SO^C, and even more preferably, 
between about 45 °C and about 55 °C. These values are based on calculations of a melting 
temperature for molecules larger than about 100 nucleotides, 0% formamide and a G + 
C content of about 40%. Alternatively, T„ can be calculated empirically as set forth in 
Sambrook et al., supra^ pages 9.3 1 to 9.62. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that comprises a nucleic acid 
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sequence selected from the group of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7 or SEQ ID NO:9 or a fragment thereof wherein the fragment encodes a protein 
that is capable of catalyzing the conversion of GDP-D-maxmose to GDP-L-galactose, such 
as under physiological conditions. In another embodiment, an epimerase encompassed by 
5 the present invention comprises an amino acid sequence selected from the group of SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 1 0 or a fragment 
thereof wherdn the fragment is capable of catal3rzing the conversion of GDP-D-mannose 
to GDP-L-galactose. It is to be understood that the nucleic add sequence encoding the 
amino acid sequences identified herein can vary due to degeneracies. As used herein, 

10 nucleotide degeneracies refers to the phenomenon that one amino acid can be encoded by 
difierent nucleotide codons. 

One embodiment of the present invention relates to a method to identify an 
epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Preferably, 
such a method is usefiil for identifying the GDP-D-mannose:GDP-L-galactose epimerase 

15 which catalyzes the conversion of GDP-D-mannose to GDP-L-galactose in the 
endogenous (i.e., naturally occurring L-ascorbic acid biosynthetic pathvmy of 
microorganisms and/or plants). Such a method can include the steps of: (a) contacting a 
source of nucleic acid molecules with an oligonucleotide at least about 12 nucleotides in 
length under stringent hybridization conditions, wherein the oligonucleotide is identified 

20 by its ability to hybridize under stringent hybridization conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ 
ID NO:5; and, (b) identifying nucleic acid molecules from the source of nucleic acid 
molecules which hybridize under stringent hybridization conditions to the oligonucleotide. 
Nucleic acid molecules identified by this method can then be isolated from the source 

25 using standard molecular biology techniques. Preferably, the source of nucleic acid 
molecules is obtained from a microoi^ganism or plant that has an ascorbic acid production 
pathway. Such a source of nucleic acid molecules can be any source of nucleic acid 
molecules which can be isolated from an organism and/or which can be screened by 
hybridization with an oligonucleotide such as a probe or a PGR primer. Such sources 

30 include genomic and cDNA libraries and isolated RNA, 
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In order to screen cDNA libraries from different organisms and to isolate nucleic 
acid molecules encoding enzymes such as the GDP-D-mannose:GDP-I/-gaIactose 
epimcrase and related cpimerases, one can use any of a variety of standard molecular and 
biochemical techniques. For example, oligonucleotide primers, preferably degenerate 
5 primers, can be designed using the most conserved regjpns of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase nucleic acid sequence, and such primers can be used in a 
polymerase chain reaction (PGR) protocol to amplify the same or related epimerases, 
including the GDP-T>-mannose:GDP-L-galactose epimerase from the ascorbic acid 
pathway, from nucleic acids (e.g., genomic or cDNA libraries) isolated from a desired 

10 organism (e.g., a microorganism or plant having an L-ascorbic acid pathway). Similarly, 
oligonucleotide probes can be designed using the most conserved regions of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase nucleic acid sequence and such probe can 
be used to identify and isolate nucleic acid molecules, such as from a genomic or cDNA 
library, that hybridize under conditions of low, moderate, or high stringency with the 

15 probe. 

Alternatively, the GDP-D-mannose:GDP-L-galactose epimerase can be purified 
from an organism such as Protoiheca, the N-terminal amino acid sequence can be 
determined including the sequence of internal peptide fragments), and this information 
can be used to design degenerate primers for amplifying a gene fragment from the 

20 organism cDNA. This fragment would then be used to probe the cDNA library, and 
subsequently fragments that hybridize to the probe would be cloned in that organism or 
another suitable production organism. There is ample precedent for plant enzymes being 
expressed in an active form in bacteria, such as E, coli. Alternatively, yeast are also a 
suitable candidate for developing a heterologous system for L-ascorbic acid production. 

25 As discussed above in general for increasing the action of an enzyme in the L- 

ascorbic acid pathway according to the present invention, in one embodiment of the 
present invention, the action of an epimerase that catalyzes the conversion of GDP-D- 
mannose to GDP-L-galactose is increased by amplification of the expression (i.e., 
overexpression) of such an epimerase. Overexpression of an epimerase can be 

30 accomplished, for example, by introduction of a recombinant nucleic acid molecule 
encoding the epimerase. It is preferred that the gene encoding an epimerase according to 
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the present invention be cloned under control of an artificial promoter. The promoter can 
be any suitable promoter that will provide a level of epimerase expression required to 
maintain a sufficient level of L-ascorbic acid in the production organism. Preferred 
promoters are constitutive (rather than inducible) promoters, since the need for addition 
5 of expensive inducers is therefore obviated. The gene dosage (copy number) of a 
recombinant nucleic acid molecule according to the present invention can be varied 
according to the requirements for maxinium product formation. In one embodiment, the 
recombinant nucleic acid molecule encoding an epimerase according to the present 
invention is integrated into the chromosome of the microorganism. 

10 It is another embodiment of the present invention to provide a microorganism 

having one or more epimerases according to the present invention with improved aflSnity 
for its substrate. An epimerase with improved affinity for its substrate can be produced 
by any suitable method of genetic modification or protein engineering. For example, 
computer-based protein engineering can be used to design an epimerase protein with 

15 greater stability and better affinity for its substrate. See for example, Maulik et al., 1997, 
Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., 
which is incorporated herein by reference in its entirety. 

As noted above, in the method for production of L-ascorbic acid of the present 
invention, a microorganism having a genetically modified L-ascorbic acid production 

20 pathway is cultured in a fermentation medium for production of L-ascorbic acid. An 
appropriate, or effective, fermentation medium refers to any medium in which a genetically 
modified microorganism of the present invention, when cultured, is capable of producing 
L-ascorbic acid. Such a medium is typically an aqueous medium comprising assimilable 
carbon, nitrogen and phosphate sources. Such a medium can also include appropriate 

25 salts, minerals, metals and other nutrients. One advantage of genetically modifying a 
microorganism as described herein is that although such genetic modifications can 
significantly alter the production of L-ascorbic acid, they can be designed such that they 
do not create any nutritional requirements for the production organism. Thus, a minimal- 
salts medium containing glucose as the sole carbon source can be used as the fermentation 

30 medium. The use of a minimal-salts-glucose medium for the L-ascorbic acid fermentation 
will also fiicilitate recovery and purification of the L-ascorbic acid product. 
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In one mode of operation of the present invention, the carbon source 
concentration, such as the glucose concentration, of the fermentation medium is 
monitored during fermentation. Glucose concentration of the fermentation medium can 
be monitored using known techniques, such as, for example, use of the glucose oxidase 
5 enzyme test or high pressure liquid chromatography, which can be used to monitor 
glucose concentration in the supernatant, e.g., a cell-free component of the fermentation 
medium. As stated previously, the carbon source concentration should be kept below the 
level at which cell growth inhibition occurs. Although such concentration may vary from 
organism to organism, for glucose as a carbon source, cell growth inhibition occurs at 

10 glucose concentrations greater than at about 60 g/L, and can be determined readily by 
trial. Accordingly, vdien glucose is used as a carbon source the glucose concentration in 
the fermentation medium is maintained in the range of from about 1 g/L to about 100 g/L, 
more preferably in the range of from about 2 g/L to about 50 g/L, and yet more preferably 
in the range of from about 5 g/L to about 20 g/L. Although the carbon source 

15 concentration can be maintained within desired levels by addition of, for example, a 
substantially pure glucose solution, it is preferred to maintain the carbon source 
concentration of the fermentation medium by addition of aliquots of the ori^nal 
fermentation medium. The use of aliquots of the original fermentation medium are 
desirable because the concentrations of other nutrients in the medium (e.g. the nitrogen 

20 and phosphate sources) can be maintained simultaneously. Likewise, the trace metals 
concentrations can be maintained in the fermentation medium by addition of aliquots of 
the trace metals solution. 

In an embodiment of the fermentation process of the present invention, a 
fermentation medium is prepared as described above. This fermentation medium is 

25 inoculated with 

an actively growing culture of genetically modified microorganisms of the present 
invention in an amount suflBcient to produce, after a reasonable growth period, a high cell 
density. Typical inoculation cell densities are within the range of from about 0. 1 g/L to 
about 15 g/L, preferably from about 0.5 g/L to about 10 g/L and more preferably from 

30 about 1 g/L to about 5 g/L, based on the dry weight of the cells. The cells are then grown 
to a cell density in the range of from about 1 0 g/L to about 1 00 g/L preferably from about 
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20 g/L to about 80 g/L, and more preferably from about 50 g/L to about 70 g/L. The 
residence times for the microorganisms to reach the desired ceU densities during 
fermentation are typically less than about 200 hours, preferably less than about 120 hours, 
and more preferably less than about 96 hours. 
5 The microorganisms useful m the method of the present invention can be cultured 

in conventional fermentation modes, which include, but are not limited to, batch, fed- 
batch, and continuous. It is preferred, however, that the fermentation be carried out in 
fed-batch mode. In such a case, during fermentation some of the components of the 
medium are depleted. It is possible to initiate fermentation with relatively high 

10 concentrations of such components so that growth is supported for a period of time before 
additions are required. The preferred ranges of these components are maintained 
throughout the fermentation by making additions as levels are depleted by fermentation. 
Levels of components in the fermentation medium can be monitored by, for example, 
sampling the fermentation medium periodically and assaying for concentrations. 

1 5 Aftematively, once a standard fermentation procedure is developed, additions can be made 
at timed intervals corresponding to known levels at particular times throughout the 
fermentation. As will be recognized by those in the art, the rate of consumption of 
nutrient increases during fermentation as the cell density of the medium increases. 
Moreover, to avoid introduction of foreign microorganisms into the fermentation medium, 

20 addition is performed using aseptic addition methods, as are known in the art. In addition, 
a small amount of anti-foaming agent may be added during the fermentation. 

The present inventors have determined that high levels of magnesium in the 
fermentation medium inhibits the production of L-ascorbic acid due to repression of 
enzymes early in the production pathway, although enzymes late in the pathway (i.e., from 

25 L-galactose to L-ascorbic acid) are not negatively affected (See Examples). Therefore, 
in a preferred embodiment of the method of the present invention, the step of culturing 
is carried out in a fermentation medium that is magnesium (Mg^"*^) limited. Even more 
preferably, the fermentation is magnesium limited during the cell growth phase. 
Preferably, the fermentation medium comprises less than about 0.5 g/L of Mg^* during the 

30 cell growth phase of fermentation, and even more preferably, less than about 0.2 g/L of 
Mg^"", and even more preferably, less than about 0. 1 g/L of Mg^*. 
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The temperature of the fermentation medium can be any temperature suitable for 
growth and ascorbic acid production, and may be modified according to the growth 
requirements of the production microorganism used. For example, prior to inoculation 
of the fermentation medium with an inoculum, the fermentation medium can be brought 
5 to and maintained at a temperature in the range of from about 20**C to about 45**C, 
preferably to a temperature in the range of fi"om about 25**C to about 40^*0, and more 
preferably in the range of firom about 30^*0 to about SS^'C, 

It is a fiirther embodiment of the present invention to supplement and/or control 
other components and parameters of the fermentation medium, as necessary to maintain 

1 0 and/or enhance the production of I^ascorbic add by a production organism. For example, 
in one embodiment, the pH of the fermentation medium is monitored for fluctuations in 
pH. In the fermentation method of the present invention, the pH is preferably maintained 
at a pH of fi-om about pH 6.0 to about pH 8.0, and more preferably, at about pH 7.0. In 
the method of the present invention, if the starting pH of the fermentation medium is pH 

15 7.0, the pH of the fermentation medium is monitored for significant variations firom pH 
7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide. In a 
preferred embodiment of the present invention, genetically modified microorganisms 
useful for production of L-ascorbic acid include acid-tolerant microorganisms. Such 
microorganisms include, for example, microalgae of the genera Prototheca and Chlorella 

20 (See U.S. Patent No. 5,792,63 1, ibid, and U.S. Patent No. 5,900,370, ibid,). 

The production of ascorbic acid by culturing acid-tolerant microorganisms 
provides significant advantages over known ascorbic acid production methods. One such 
advantage is that such organisms are acidophilic, allowing fermentation to be carried out 
under low pH conditions, with the fermentation medium pH typically less than about 6. 

25 Below this pH, extracellular ascorbic acid produced by the microorganism during 
fermentation is relatively stable because the rate of oxidation of ascorbic acid in the 
fermentation medium by oxygen is reduced. Accordingly, high productivity levels can be 
obtained for producing L-ascorbic acid with acid-tolerant microorganisms according to 
the methods of the present invention. In addition, control of the dissolved oxygen content 

30 to very low levels to avoid oxidation of ascorbic acid is unnecessary. Moreover, tiiis 



wo 99/64618 PCTAJS99/1 1576 

45 

advantage allows for the use of continuous recovery methods because extracellular 
medium can be treated to recover the ascorbic add product. 

Thus, the present method can be conducted at low pH when acid-tolerant 
microorganisms are used as production organi^s. The benefit of this process is that at 
5 low pH, extracellular ascorbic acid produced by the organism is degraded at a reduced 
rate than if the fermentation medium was at higher pH. For example, prior to inoculation 
of the fermentation medium with an inoculum, the pH of the fermentation medium can be 
adjusted, and further monitored during fermentation. Typically, the pH of the 
fermentation medium is brought to and maintained below about 6, preferably below 5.5, 

10 and more preferably below about 5. The pH of the fermentation medium can be 
controlled by the addition of ammonia to the fermentation medium. In such cases when 
ammonia is used to control pH, it also conveniently serves as a nitrogen source in the 
fermentation medium. 

The fermentation medium can also be maintained to have a dissolved oxygen 

15 content during the course of fermentation to maintain cdl growth and to maintain cell 
metabolism for I^ascorbic add formation. The oxygen concentration of the fermentation 
medium can be monitored using known methods, such as through the use of an oxygen 
probe electrode. Oxygen can be added to the fennentation medium using methods known 
in the art, for example, through agitation and aeration of the medium by stirring or 

20 shaking. Preferably, the oxygon concentration in the fermentation medium is in the range 
of from about 20% to about 100% of the saturation value of oxygen in the medium based 
upon the solubility of oxygen in the fermentation medium at atmospheric pressure and at 
a temperature in the range of from about 30°C to about 40'*C. Periodic drops in the 
oxygen concentration below this range may occur during fermentation, however, without 

25 adversely affecting the fermentation. 

The genetically modified microorganisms of the present invention are engineered 
to produce significant quantities of extracellular L-ascorbic acid. Extracellular L-ascorbic 
add can be recovered from the fermentation medium using conventional separation and 
purification techniques. For example, the fermentation medium can be filtered or 

30 centrifiiged to remove microorganisms, cell debris and other particulate matter, and L- 
ascoibic add can be recovered from the cell-fiee supemate by conventional methods, such 
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as, for example, ion exchai^e, chromatography, extraction, solvent extraction, membrane 
separation, electrodialysxs, reverse osmosis, distillation, chemical derivatization and 
crystallization. 

One such example of L-ascorbic acid recovery is provided in U.S. Patent No. 
5 4,595,659 by Cayle, incorporated herein in its entirety be reference, which discloses the 
isolation of L-ascorbic add from an aqueous fermentation medium by ion exchange resin 
adsorption and elution, which is followed by decoloration, evaporation and crystallization. 
Further, isolation of the structurally similar isoascorbic acid from fermentation medium 
by a continuous multi-bed extraction system of anion-exchange resins is described by K. 
10 Shimizu, Agr, Biol. Chenu 3 1:346-353 (1967), which is incorporated herein in its entirety 
by reference. 

Intracellular L-ascorbic acid produced in accordance with the present invention 
can also be recovered and used in a variety of applications. For example, cells from the 
nucroorganisms can be lysed and the ascorbic acid which is released can be recovered by 

15 a variety of known techniques. Alternatively, intracellular ascorbic acid can be recovered 
by washing the cells to extract the ascorbic acid, such as through diafiltration. 

Development of a microorganism with enha2K:ed ability to produce L-ascorbic add 
by genedc modification can be accomplished using both classical strain development and 
molecular genetic techniques, and particularly, recombinant technology (genetic 

20 engineering). In general, the strategy for creating a microorganism with enhanced L- 
ascorbic add production is to (1) inactivate or delete at least one, and preferably more 
than one of the competing or inhibitory pathways in which production of L-ascorbic add 
is negativdy aflfected (e.g., inhibited), and more significantly to (2) amplify the L-ascorbic 
acid production pathway by increasing the action of a gene(s) encoding an enzyme(s) 

25 involved in the pathway. 

In one embodiment, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to amplify the L-ascorbic acid production pathway by 
increasing the action of GDP-D-mannose:GDP-L-galactose epimerase, as discussed 
above. Such strategy includes genetically modifying the endogenous GDP-D- 

30 mannose:GDP-L-galactose epimerase such that L-ascorbic acid production is increased, 
and/or expressing/overexpressing a recombinant epimerase that catalyzes the conversion 
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of GDP-E>-inannose to GDP-L-galactose, which includes oqn^ession of recombinant GDP- 
D-mannose:GDP-L-galactose epimerase and/or homologues thereof, and of other 
recombinant epimerases such as GDP-4-keto-6-deoxy-D-mannose epimerase reductase 
and epimerases that share structural homology with such epimerase as discussed in detail 
5 above. 

It is to be understood that a production organism can be genetically modijfied by 
recombinant technology in which a nucleic acid molecule encoding a protein involved in 
the L-ascorbic acid production pathway disclosed herein is transformed into a suitable 
host which is a different member of the plant Idngdom from which the nucleic acid 
10 molecule was derived. For example, it is an embodiment of the present invention that a 
recombinant nucleic add molecule encoding a GDP-D-mannose:GDP-L-galactose 
epimerase from a higher plant can be transformed into a microalgal host in order to 
overexpress the epimerase and enhance production of L-ascorbic add in the microalgal 
production organism. 

15 As previously discussed herein, in one embodiment, a genetically modified 

microorganism can be a microoiganism in which nucldc add molecules have been deleted, 
inserted or modified, such as by insertion, deletion, substitution, and/or inversion of 
nucleotides, in sudi a manner that such modifications provide the desired effect within the 
microorganism. A genetically modified microorganism is preferably modified by 

20 recombinant technology, such as by introduction of an isolated nucleic acid molecule into 
a microorganism. For example, a genetically modified microorganism can be transfected 
with a recombinant nucldc add molecule encoding a protein of interest, such as a protein 
for which increased expression is desired. The transfected nucleic acid molecule can 
remain extrachromosomal or can integrate into one or more sites within a chromosome 

25 of the transfected (i.e., recombinant) host cell in such a manner that its ability to be 
expressed is retained. Preferably, once a host cell of the present invention is transfected 
with a nucldc acid molecule, the nucleic acid molecule is integrated into the host cell 
genome. A significant advantage of integration is that the nucleic acid molecule is stably 
maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule 

30 is operatively linked to a transcription control sequence (described below) which can be 
induced to control expression of the nucleic acid molecule. 
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A nucldc acid molecule can be integrated Into the genome of the host ceU either 
by random or targeted integration. Such methods of integration are known in the art. For 
example, BnK coii strain ATCC 47002 contains mutations that confer upon it an inability 
to maintain plasmids which contain a CoIEl origin of replication. When such plasmids are 
S transferred to this strain, selection for genetic markers contained on the plasmid results 
in integration of the plasmid into the chromosome. This strain can be transformed, for 
example, with plasmids containing the gene of interest and a selectable marker flanked 
by the 5'- and 3 '-termini of the K coli lacZ gene. The lacZ sequences target the incoming 
DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus 

10 replaces the intact lacZ gene, which encodes the enzyme P-galactostdase, with a partial 
lacZ gene interrupted by the gene of interest. Successful integrants can be selected for 
p-galactosidase negativity. 

A genetically modified microorganism can also be produced by introducing nucleic 
acid molecules into a recipient cell genome by a method such as by using a transducing 

15 bacteriophage. The use of recombinant technology and transducing bacteriophage 
technology to produce several different genetically modified microorganism of the present 
invention is known in the art. 

According to the present invention, a gene, for example the GDP-D- 
mannose:GDP-L-gal£ictose epimerase gene, includes all nucleic acid sequences related to 

20 a natural epimerase gene such as regulatory regions that control production of the 
epimerase protein encoded by that gene (such as, but not limited to, transcription, 
translation or post-translation control regions) as well as the coding region itself. In 
another embodiment, a gene, for example the GDP-D-mannose:GDP-L-gaiactose 
epimerase gene, can be an allelic variant that includes a similar but not identical sequence 

25 to the nucleic acid sequence encoding a given GDP-D-mannose:GDP-L-galactose 
epimerase gene. An allelic variant of a GDP-D-mannose:GDP-L-galactose epimerase 
gene which has a given nucleic acid sequence is a gene that occurs at essentially the same 
locus (or loci) in the genome as the gene having the given nucleic acid sequence, but 
whidi, due to natural variations caused by, for example, mutation or recombination, has 

30 a similar but not identical sequence. Allelic variants typically encode proteins having 
similar activity to that of the protein encoded by the gene to which they are being 
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compared. Allelic variants can also comprise alterations in the 5* or 3* untranslated 
regions of the gene (e.g., in regulatory control regions). Allelic variants are well known 
to those skilled in the art and would be expected to be found within a given 
microorganism or plant and/or among a group of two or more microorganisms or plants. 
5 In accordance with the present invention, an isolated nucleic acid molecule is a 

nucleic acid molecule that has been removed from its natural milieu (i.e., that has been 
subject to human manipulation). As such, "isolated" does not reflect the extent to which 
the nudeic acid molecule has been purified. An iscdated nucleic acid molecule can include 
DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a 
10 practical limit, on the maximal size of a nucleic acid molecule in that the nucleic acid 
molecule can include a portion of a gene, an entire gene, or multiple genes, or portions 
thereof 

An isolated nucleic acid molecule of the present invention can be obtained from 
its natural source either as an entire (i.e., complete) gene or a portion thereof capable of 

1 5 forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be 
produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) 
amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include 
natural nucleic acid molecules and homologues thereof, including, but not limited to, 
natural allelic variants and modified nucleic acid molecules in which nucleotides have been 

20 inserted, deleted, substituted, and/or inverted in such a manner that such modifications 
provide the desired efiect within the microorganism. A structural homologue of a nucleic 
acid sequence has been described in detail above. Preferably, a homologue of a nucleic 
acid sequence encodes a protein which has an amino acid sequence that is suflSciently 
similar to the natural protein amino acid sequence that a nucleic acid sequence encoding 

25 the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic 
acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid 
strand encoding the natural protein anfiino acid sequence). A nucleic acid molecule 
homologue encodes a protein homologue. As used herein, a homologue protein includes 
proteins in which amino acids have been deleted (e.g., a truncated version of the protein, 

30 such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by 
glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation. 



1 



wo 99/6461 8 PCTAJS99/1 1 576 

50 

amidation and/or addition of glycosylphosphatidyl inositol) in such a manner that such 
modifications provide the desired effect on the protein and/or within the microorganism 
(e.g., increased or decreased action of the protein). 

A nucleic add molecule homologue can be produced using a number of methods 
S known to those skilled in the art (see, for example, Sambrook et al., ibid). For example, 
nucleic acid molecules can be modified using a variety of techniques including, but not 
limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site- 
directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, 
restriction enzyme cleavage of a nucleic acid firagment, ligation of nucleic acid Segments, 

10 PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, 
synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture 
of nucleic add molecules and combinations thereof. Nucleic add molecule homologues 
can be selected fi-om a mixture of modified nucleic adds by screening for the fimction of 
the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene. 

1 5 Although the phrase "nuddc add molecule" primarily refers to the physical nucldc 

add molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of 
nucleotides on the nucleic add molecule, the two phrases can be used interchangeably, 
espedally with respect to a nucleic add molecule, or a nucldc acid sequMce, being 
capable of encoding a gene involved in an L-ascorbic add production pathway. 

20 Knowing the nucldc acid sequences of certain nucldc acid molecules of the 

present invention allows one skilled in the art to, for example, (a) make copies of those 
nucldc add molecules and/or (b) obtain nucldc add molecules including at least a portion 
of such nucleic acid molecules (e.g., nucleic acid molecules including fiiU-length genes, 
fiiU-length coding regions, regulatory control sequences, tmncated coding regions). Such 

25 nucldc add molecules can be obtained in a variety of ways including traditional cloning 
techniques using oligonucleotide probes to screen appropriate libraries or DNA and PCR 
amplification of appropriate libraries or DNA using oligonudeotide primers. Preferred 
libraries to screen or from which to amplify nucleic acid molecule include bacterial and 
yeast genomic DNA libraries, and in particular, microalgal genomic DNA libraries. 

30 Techniques to clone and amplify genes are disdosed, for example, in Sambrook et al., 
ibid 
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The present invoition includes a recombinant vector, which includes at least one 
isolated nucleic acid molecule of the present invention, inserted into any vector c£q)able 
of delivering the nudeic acid molecule into a host microorganism of the present invention. 
Such a vector can contain nucleic acid sequences that are not naturally found adjacent to 
5 the isolated nucleic acid molecules to be inserted into the vector. The vector can be either 
RNA or DNA and typically is a plasmid. Recon^inant vectors can be used in the cloning, 
sequencing, and/or otherwise manipulating of nucleic acid molecules. One type of 
recombinant vector, referred to herein as a recombinant molecule and described in more 
detail below, can be used in the expression of nucleic acid molecules. Preferred 

10 recombinant vectors are capable of repHcating in a transformed bacterial cells, yeast cells, 
and in particular, in microalgal cells. 

Transformation of a nucleic acid molecule into a cell can be accomplished by any 
method by which a nucleic acid molecule can be inserted into the cell. Transformation 
techniques include, but are not limited to, transfecdon, electroporation, microinjection and 

15 biolistics. 

A recombinant cell is preferably produced by transforming a host cell with one or 
more recombinant molecules, each comprising one or more nucleic acid molecules 
operatively linked to an expression vector containing one or more transcription control 
sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule 

20 into an expression vector in a manner such that the molecule is able to be expressed when 
transformed into a host cell. As used herein, an expression vector is a DNA or RNA 
vector that is capable of transforming a host cell and of effecting expression of a spedfied 
nucleic acid molecule. Preferably, the expression vector is also capable of replicating 
within the host cell. In the present invention, expression vectors are typically plasmids. 

25 Expression vectors of the present invention include any vectors that flmction (i.e., direct 
gene expression) in a yeast host cell, a bacterial host cell, and preferably a microalgal host 
cell. 

Nucleic acid molecules of the present invention can be operatively linked to 
expression vectors containing regulatory sequences such as transcription control 
30 sequences, translation control sequences, origins of replication, and other regulatory 
sequences that are compatible with the recombinant cell and that control the expression 
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of nucleic acid molecules of the present invention. In particular, recombinant molecules 
of the present invention include transcription control sequences. Transcription control 
sequences are sequences which control the initiation, elongation, and termmation of 
transcription. Particularly important transcription control sequences are those which 
5 control transcription initiation, such as promoter, enhancer, operator and repressor 
sequences. Suitable transcription control sequences include any transcription control 
sequence that can function in yeast or bacterial cells or preferably, in microalgal cells. A 
variety of such transcription control sequences are known to those skilled in the art. 

It may be appreciated by one skilled in the art that use of recombinant DNA 

10 technologies can improve expression of transformed nucleic acid molecules by 
manipulating, for example, the number of copies of the nucleic acid molecules within a 
host cell, the cflBciency with which those nucleic acid molecules are transcribed, the 
eflBciency with which the resultant transcripts are translated, and the efiBciency of post- 
translational modifications. Recombinant techniques useful for increasing the expression 

15 of nucleic acid molecules of the present invention include, but are not limited to, 
operatively linking nucleic acid molecules to high-copy number plasmids, integration of 
the nucleic add molecules into the host cell chromosome, addition of vector stability 
sequences to plasmids^ substitutions or modifications of transcription control signals (e.g., 
promoters, operators, enhancers), substitutions or modifications of translational control 

20 signals, modification of nucleic acid molecules of the present invention to correspond to 
the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use 
of control signals that temporally separate recombinant cell growth firom recombinant 
enzyme production during fermentation. Hie activity of an expressed recombinant protein 
of the present invention may be improved by fi^agmenting, modifying, or derivatizing 

25 nucleic acid molecules encoding such a protein. 

The following experimental results are provided for the purposes of illustration and 
are not intended to limit the scope of the invention. 
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EXAMPLES 

Example 1 

The present example describes the elucidation of the pathway from glucose to L- 
ascorbic acid through GDP-D-mannose in plants. 
5 Since the present inventors have previously shown that Protoiheca makes L- 

ascorbic acid (AA) from glucose, it was worthwhile to examine cultures for some of the 
early convemon products of ghicose. In the past, the present inventors had concentrated 
on pathways from glucose to oiganic adds, based on the published pathway of L-ascorbic 
acid synthesis in animals and proposed pathways in plants. The present inventors 

10 demonstrate herein that the pathway from glucose to L-ascorbic add involves not organic 
adds, but rather sugar phosphates and nudeotide diphosphate sugars (NDP-sugars). 

Prior to the present invention, it was known that all cells synthesize 
polysaccharides by first forming NDP-sugars, The sugar moiety is thra incorporated into 
polymer, while the cleaved NDP is recyded. A variety of polysaccharides are known, and 

15 are usually named based on the relative proportions of the sugar residues in the polymers. 
For example, a "galactomaiman** contains mostly galactose, and to a lesser degree, 
mannose residues. The "biopoljmfier" from Protoiheca strains isolated by the present 
inventors was analyzed and found to be 80% D-galactose, 18% rhamnose (D- or L- 
configuration not determined), and 2% L-arabinose. The present inventors provide 

20 evidence herein of how the respective NDP-sugars that make up the Protoiheca 
biopolymer are formed, and what correlations exist between L-ascorbic add synthesis and 
the formation of the NDP-sugar forms of the sugar residues found in the biopolymer. 

The common NDP-sugar UDP-glucose is shown in Fig. 2B. This is formed in 
plants from glucose-l-P by the action of UDP-D-glucose pyrophosphorylase. UDP- 

25 glucose can be epimerized in plants to form UDP-D-galactose, using UDP-D-glucose-4- 
epimerase. UDP-D-galactose can also be formed by phosphorylation of D-galactose by 
galactokinase to form D-galactose-l-P, which can be converted to UDP-D-galactose by 
UDP-D-galactose pyrophosphorylase. These known routes were believed to account for 
the D-galactose in the Protoiheca biopolymer. The UDP-L-arabinose can be formed by 

30 known reactions beginning with the oxidation of UDP-D-glucose to UDP-D-glucuronic 
acid (by UDP-D-glucose dehydrogenase), decarboxylation to UDP-D-xylose, and 
epimerization to UDP-L-arabinose. This accounts for the arabinose residues in the 
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biopolymer. UDP-L-rhamnose is known to be formed from UDP-D-^ucose, thus all 
three of the sugar moieties in the Prototheca biopolymer can be accounted for by a 
pathway through glucose- 1-P and UDP-glucose. Akematively, if the rhamnose in the 
biopolymer is D-rhaninose, it is not formed via UDP-D-glucose, but by oxidation of GDP- 
5 D-mannose (See Fig. 1). 

GDP-D-rhamnose is formed by converting glucose, in turn, to D-ghicose-6-P, D- 
fiuctose-6-P, D-mannose-6-P, D-maraiose-l-P, GDP-D-mannose, and GDP-D-rhamnose. 
It was of interest to the present inventors that this route passes through GDP-D-mannose. 
Exogenous mannose is known to be converted to D-mannose-6-P in plants, and can enter 
0 the path above. D-mannose is converted to L-ascorbic acid hy Prototheca cells cultured 
by the present inventors as well or better than glucose (see Example 4). The mechanism 
of conversion, in Chlorella pyrenoidosa , of GDP-D-mannose to GDP-L-galactose by 
GDP-D-mannose:GDP-L-galactose epimerase, has been known for years (See, Barber, 
1971, Arch. Biochem. Biophys, 147:619-623, incorporated herein by reference in its 
5 entirety). The presem inventors have discovered herein that I^galactose and I^galactonc^ 
y-lactone are rapidly converted to L-ascorbic acid by strains of Prototheca and Chlorella 
pyrenoidosa. Prior to the present invention, it was knovsm that L-galactono-y -lactone is 
converted to L-ascorbic add in several plant systems, but the synthesis steps prior to this 
step were unknown. Based on the published literature and the present experimental 
evidence, the present inventors have determined that the L-ascorbic acid biosynthetic 
pathway in plants passes through GDP-D-mannose and involves sugar phosphates and 
NDP-sugars, The proposed pathway is shown in Fig. 1. Salient points relevant to the 
design and production of genetically modified microorganisms useful in the present 
method include: 

1. The enzymes leading from D-glucose to D-fructose-6-P are well known 
enzymes in the first, uncommitted steps of glycolysis. 

2. The enzymes involved in the conversion of D-fiaictose-6-P to GDP-D- 
mannose have been well characterized in plants, yeast, and bacteria, particularly 
Azotobacter vinelartdii and Pseudomonas aeruginosa, which convert GDP-D-mannose 
to GDP-D-mannuronic acid, which is the precursor for alginate (See for example, Sa- 
Corrda et al., 1987. J, BacterioL 169:3224-3231; Koplin et al., 1992, J. Bacteriol, 
174:191-199; Oesterhek et al., 1996, Plant Science 121: 19-27; Feingold et al., 1980, Ihe 
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Biochemistry of Plants : Vol 3: Caibohydrates, structure and function, P.K. Stampf & E.E. 
Conn, eds.. Academic Press, New York, pp. 101-170; Smith et al., 1992, Afo/. Cell Biol 
12:2924-2930; Boles et al., 1994, Eur, J, Meet 220:83-96; Hashimoto et al., 1997, J. 
BioL Chem. 212: 16308-163 14, all of which are incorporated herein by reference in their 
5 entirety). 

3. Barber (1971, supra, and 1975) identified in Chlorella pyrenoidosa the 
en2yme activities for the conversion of GDP-D-mannose to GDP-L-galactose and L- 

j galactose-l-P. 

4. The present inventors have shown herein the rapid conversion of L- 
10 galactose and L-galactono-y -lactone to L-ascorbic acid by Protoiheca cells. 

5. L-galactono-y-lactone and L-galactonic acid can be interconverted in 
solution by changing the pH of the solution; addition of base shifts the equilibrium to L- 
galactonic acid, while addition of add shifts the equilibrium to the lactone. Cells may have 
an enzymatic means for this conversion in addition to this non-enzymatic route. 

15 6. In plants, GDP-I^fucose is also formed from GDP-D-mannose, presumably 

for incorporation into polysaccharide. Roberts (1971) fed labeled D-mannose to com root 

I 

tips and found the label in polysaccharide, specifically in the residues of D-mannose, L- 
galactose, and L-fiicose. No label was detected in D-glucose, D-galactose, L-arabinose, 
or D-xylose. Protoiheca and C, pyrenoidosa cells have the ability to convert L-fiicose 
20 (6-deoxy-L-galactose) to a dipyridyl-positive product that was shown by HPLC not to be 
L-ascorbic acid. The present inventors believe that it is was the 6-deoxy analog of L- 
ascorbic acid. 



Example 2 

This example shows that m Protoiheca, like other plants (Loewus, F.A. 1988. In: 
25 J. Priess (ed.), The Biochemistry of Plants, 14:85-107. New York, Academic Press) and 
the green microalga Chlorella pyrenoidosa (Renstrom, et al,, 1983. Plant Sci. Lett. 
28:299-305), ascorbic acid (AA) production firom glucose proceeds by a biosynthetic 
pathway that allows retention of the configuration of the carbon skeleton of glucose. 

Cultures of the strain UV77-247 were grown to moderate cell density in shake 
30 flasks with l-*^C-labeled glucose as 10% of the total glucose (40 g/L). Incubation was 
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as per the standard Mg-limhed screen (see Example 3). The culture supemates were 
clarified, deionized to remove salts, lyophilized, and subjected to nudear magnetic 
resonance (nmr) analysis to determine where in the AA molecule the ^^C was located. In 
each case, approximately 85% of the label was found at the C-1 position of AA, with most 
5 of the remaining label at the C-6 position. This strongly indicated that AA is synthesized 
from glucose by a pathway that retains the carbon chain configuration, /. e., C-1 of 
glucose becon>es C-1 of AA This has typically been observed in plants (Loewus, F.A. 
1988. Ascorbic acid and its metabolic products. In: The Biochemistry of Plants, ed. J. 
Priess, 14:85-107. New York, Academic Press). Animals (Mapson, L.W. and F.A. 

10 Isherwood 1956. BiochenL J. 64:151-157; Loewus, F.A. 1960. J. Biol. Chem. 
235(4):937-939) and protists such as Euglena (Shigeoka, S,, et ah, 1979. J. Nutr. Sci. 
Vitaminol. 25:299-307), on the otho" hand, synthesize AA by a pathway that involves the 
inversion of configuration, /. e., C-1 of glucose becomes C-6 of AA Demonstration of 
the inversion/non-inversion nature of the pathway was an important step in determining 

15 the pathway of AA biosynthesis since the two types of pathways require diflferent types 
of enzymatic reactions. The label fbfund at C-6 of AA is thought to be due to metabolism 
of glucose and subsequent gluconeogenesis. The metabolism of glucose in glycolysis 
proceeds through triose-phosphate intermediates. After this, the C-1 and C-6 carbons of 
glucose become biochemically equivalent. Hexose phosphates can be regenerated fi^om 

20 the triose phosphates by gluconeogenesis, which is essentially a reversal of the degradative 
pathway. Consequently, metabolism of C-l-labeled glucose to triose phosphates with 
subsequent gluconeogenesis would result in the formation of hexose phosphate molecules 
labded at either or both C-1 and C-6, If those hexose phosphates were precursors to AA, 
one would expect the AA to be similarly labeled. Consistent with this type of "isotopic 

25 mbdng" is the observation that sucrose obtained fi:'om l-*^C-labeled glucose was labeled 
at positions 1, 6, 1' and 6*. 

Glucose can also be metabolized by the pentose phosphate pathway, the overall 
balanced equation for which is: 



3 Glucose-6-phosphate — ► 2 Fruclose-6-phosphate + Glyceraldehyde-3-phosphate + 3 COj 
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Based on the known biochemistry, it would then be expected that the label at each 
of the carbons in glucose (Table 1 left column) would appear at the positions for the other 
molecules shown, and that these patterns would be reflected in the AA formed from C-2- 
and C-3-labelcd glucose. 

5 TABLE 1 

Predicted Carbon Labeling of Metabolites of Glucose in the Pentose Phosphate Pathway 



Labeled Glucose 
Cart>on 


Posftran of Labeled CartxKi in: 




F6P(1) 


F6P(2) 


G3P 


1 


+ 








2 




1.3 


1 




3 




2 


2,3 




4 




4 


4 


1 


5 




5 


5 


2 


6 




6 


6 


3 



1 5 AA recovered from cultures fed glucose labeled at C-2 or C-3 was also analyzed 

for its labeling patterns (Table 2). 



TABLE 2 

Lat)eling Pattern tn AA after Cells were Fed 2-^^C and 3-^^C-fllucose 



Cart>on 
Position in AA 


Isotopic enhancement after growth on: 


G-2 labeled glucose 


C-3 lat>eled glucose 


1 


1.0 


0.4 


2 


10.0 


0.9 


3 


0.5 


9.9 


4 


0 


2.8 


5 


2.2 


0.2 


6 


0 


0 



The data above again suggest a pathway from glucose to AA that proceeds by 
retention of configuration. As in the experiments with C-1 labeled glucose, approximately 
one-fiflh of the label is present in "mirror image" position to the glucose label (C-5 for C-2 
30 labeled glucose and C-4 for C-3 labeled glucose), indicating levels of gluconeogenesis 
consistent with those previously observed. 

The small, but significant amount of enhancement observed in other positions is 
consistent with flux through the pentose phosphate pathway. As predicted above, carbon 
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flux through this pathway would result in isotopic enhancement at positions 1 and 3 when 
cells were grown on 2-"C glucose and enhancement at position 2 when cells were grown 
on 3-*^ glucose. This is indeed observed. That there is twice as much enhancement at 
C-1 as there is at C-3 after growth on ghicose is also prediaed. These data indicate 
5 a small but measurable amount of carbon flux through the pentose phosphate pathway. 



Example 3 

This example shows the methods for generating, screening and isolating mutants 
of Prototheca with altered AA productivities compared to the starting strain ATCC 
75669. 

10 ATCC No. 75669, identified as Prototheca moriformis RSP1385 (unicellular 

green microalga), was deposited on February 8, 1994, with the American Type Culture 
Collection (ATCC), RocIcviUe, Maryland, 20852, USA, under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure. Initial screening of Prototheca species and strains was reported in 

15 U.S. Patent No. 5,900,370, ibid. Table 3 lists the formulations of the media for growth 
and maintenance of the strains. Glucose for fermentors was supplied as glucose 
monohydrate and calculated on an anhydrous basis. The recipe for the trace metals 
solution is given in Table 4. The standard growth temperature was 35 *C. All organisms 
were cultured axenically. 



20 TABLE 3 

MedJa for Growth and Maintenance of Prototheca Strains 
All quantities are in g/L unless otherwise spedfied 



Irigredient 


Liquid 


Agar 


Standard 


Mg-limrting 


Slants 


Ferrozine 
Plates 


Standard 
Plates 


Potassium phosphate 
monobasic 


1.3 


1.3 


2.0 


0.27 


2.0 


Potassium phosphate dibasic 


3.8 


3.8 


2.0 


1.4 


2.0 


Trisodium citrate dihydrate 


7.7 


7.7 


2.6 


1.3 


2,6 


Magnesium sulfate 
hepta hydrate 


0.40 


0.02 


0.4 


0.01 


0.4 


Ammonium sulfate 


3.7 


3.7 


1,0 


1.0 


1,0 


Trace Metals Solution 


2mL 


2mL 


2mL 


2mL 


2mL 


Ferrous sulfate heptahydrate 


1.5 mg 


4.5 mg 


1.5 mg 




1.5 mg 


Caldum chloride dihydrate 




0.25 
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Ingredient 


Liquid 


Agar 


Standard 


Mg-itmiting 


Slants 


Ferrozine 
Plates 


Standard 
Plates 


Manganous sulfate 
monohydrate 




0.08 








Yeast extract 






2.5 






Agar 






15 


15 
(Noble) 


15 


pH before autodavmg 


7J2 


7J2 


7.2 




7.2 


Autoclave, then add 


Copper sulfate, 
pentahydrate, 100 g/L 








2mL 




40 g/L Ferrozine in 5 mM 
phosphate (pH 7.5 final) 








8.8 mL 




Ferric ammonium sulfate 
dodecahydrate, 40 g/L 








3.8 mL 




50% glucose with 25 mg/L 
thiamine HCI 


40 mL 


60 mL 


10 mL 


10 mL 


10 mL 



15 TABLE 4 

Trace Metals Solution 



Compound 


Molecular Weight 


Cone, of Individ. 
Solutions, g/L 


mL Indiv. Stock per 
liter of Working Stock 


Distilled Water 






823 


Hydrochloric Acid 




Cone. 


20 


Cobalt Chloride hexahydrate 


237,9 


24.0 


6,5 


Boric ackj 


61.8 


38.1 


24 


Zinc sulfate heptahydrate 


287,5 


35.3 


50 


Manganous sulfate 
monohydrate 


169.0 


24.6 


50 


Sodium molybdate dihydrate 


242.0 


23.8 


2.0 


Calcium chloride dihydrate 


147.0 




11.4g 


Vanadyl sulfate dihydrate 


199.0 


10.0 


8.0 


Nickel nitrate hexahydrate 


290.8 


5.0 


8.0 


Sodium selenite 


173.0 


5.0 


8.0 



30 Mutant isolates were generated by treatment with one or more of the following 

agents: nitrous acid (NA); ethyl methane sulfonate (EMS); or ultraviolet light (UV). 
Typically, glucose-depleted cells grown in standard liquid medium were washed and 
resuspended in 25 mM phosphate buflfer, pH 7.2, diluted to approximately 10^ 
colony-forming units per mL (cfii/mL), exposed to the mutagen to achieve about 99% kill, 

35 incubated 4-8 hours in the dark, and spread onto standard agar medium, or agar media 
containing differential agents. 
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Some mutant colonies on standard agar medium were picked randomly and 
subcultured to master plates. Other isolation plates were inverted over chloroform to lyse 
cells on the surface of the colonies and allow them to release AA. Released AA was 
detected by spraying the treated plates with a solution of 2,6-dichrorophenoI-indophenol 
5 (1,25 in 70% EtOH). The ability of AA to reduce this blue redox dye to its colorless 
form is the basis for a standard assay of AA (Omaye, et aL, 1979. Meth, Enzymol. 
62:3-1 1.). Colonies derived from mutagenized cells were saved to master plates for 
fijTther evaluation if their clear halos were significantly larger than the halos typical of the 
other mutants in that group. Other mutagenized ceUs were spread onto plates containing 

10 an AA detection system incorporated directly into the agar. This system is based on the 
ability of AA to reduce ferric iron to ferrous iron. The compound ferrozine 
(3-(2-pyridyl)-5,6- bis(4-phenylsulfonic acid)-l,2,4-triazine) was present in the agar to 
complex with the ferrous iron and give a violet color reaction. The ferrozine agar 
formulation is shown in Table 3. Colonies giving the darkest color reactions were 

15 master-plated. When screening for non-AA-producing strains (blocked mutants), white 
colonies were chosen against a background of relatively dark colonies. 

For primary screening of tube cultures, cells were inoculated from master plates 
into 4 mL of Mg-Kmiting medium in 16 x 125 mm test tubes, and tubes were shaken in a 
slanted position on a rotary shaker at 300 rpm for four days. After both three and four 

20 days of incubation aliquots were removed for AA assay and cell density determination. 
Those for AA assay were centrifiiged at 1500 x g for 5 min and the resulting supemates 
were removed for either colorimetric assay or high pressure liquid chromatography 
(HPLC). Promising isolates were retested in tube culture. Those passing the tube screen 
were tested in shake flasks. 

25 For secondary screening of flask cultures, cells were inoculated into 50 mL of 

standard flask medium in 250 mL baffled shake flasks, and incubated on a rotary shaker 
at 180 rpm until glucose depletion (24-48 hours). A second series of flasks of 
Mg-sufficient standard medium was inoculated from the first set to a cell density of 0. 15 
A^2o> and incubated for 24 hours. A third series of Mg-limiting flask medium was 

30 inoculated finom the second set by a 1/50 dilution and incubated for 96 hours. Flasks were 
sampled for AA analysis and cell density measurements during this time as required. 
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Aliquots for supernatant AA analysis were centrifliged at 5000 x g for 5 min. Aliquots 
for tota] whole broth AA analysis were first extracted for 1 5 min with an equal volume of 
5% trichloroacetic add (TCA) before centrifiigation. Aliquots of the resulting supemates 
were removed for either colorimetric assay or HPLC analysis. 
5 For colorimetric assay of AA, a modification of the method of Omaye, et aL 

(1979. Meth. Enzymol. 62:3-1 1) was used. Twenty-five jiL alicpiots of culture supemates 
were added to wells of 96-well microplates, and 125 fxL of color reagent was added. The 
color reagent consisted of four parts 0.5% aqueous 2,2 -dipyridyl and one part 8.3 mM 
ferric ammonium sulfate in 27 % (v/v) o-phosphoric acid, the two components being 

10 mixed immediately before use. After one hour, the absorbance at 520 nm was read. AA 
concentration was calculated by comparison of the absorbances of AA standards. 

HPLC analysis was based on that of Running, et al,^ (1994). Supemates were 
chromatographed on a Bio-Rad HPX-87H organic acid column (Bio-Rad Laboratories, 
Richmond, CA) with 13 mM nitric acid as solvent, at a flow rate of 0.7 mL/min at room 

15 temperature. Detection was at either 254 nm using a Waters 441 detector (MMIipore 
Corp., Milford, MA), or at 245 nm using a Waters 481 detector. This system can 
distinguish between the L- and D- isomers of AA. 

For dry weight determinations of cell density, 5 mL whole broth samples were 
centrifliged at 5000 x g for 5 min, washed once with distilled water, and the pellet was 

20 washed into a tared aluminum weighing pan. Cells were dried for 8-24 h at 1 05 C. Cell 
weight was calculated by difference. 

Table 5 shows the abilities of various mutants of Protoiheca to synthesize AA. 



TABLES 

AA Synthesizing Ability of Various Prototheca Mutants in Flask Screen 



Strain 


Specific AA Formation, mg AA per L/Cutture Ae2o. 
during Mg-limtted Incubation 


2 Days Incubation 


4 Days Incubation 


ATCC 75669 


22 


35 


EMS13-4 


79 


166 


UV213-1 


0 


0 


UV218-1 


0 


0 


UV244-1 


0 


0 
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Strain 


Specific AA Formation, mg AA per L/Cutture Ae2o. 
during Mg-limlted Incubation 


2 Days Incubation 


4 Days Incubation 


UV244-15 


58 


68 


UV77-247 


56 


83 


UV140-1 


67 


100 


UV164-6 


91 


131 


NA21-14 


27 


78 


UV82-21 


0 


0 


UV1 27-10 


50 


95 


SP2-3 


3 


4 



1 0 The genealogy of these isolates is presented graphically in the "fairuly tree" of Rg. 

3. ATCC No. , identified as Prototheca moriformis EMS13-4 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 201 10, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 

15 for the Purpose of Patent Procedure. ATCC No. , identified as Prototheca 

moriformis UV127-10 (unicellular green microalga), was deposited on May 25, 1999, 
with the American Type Culture Collection (ATCC), 10801 University Boulevard, 
Manassas, VA 201 10, USA, under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. 

20 ATCC No. , identified as Prototheca moriformis SP2-3 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 20110, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 
for the Purpose of Patent Procedure. 

25 Example 4 

The following example shows that both growing and resting cells of Prototheca 
can rapidly convert L-galactose and L-galactono-Y-lactone to AA, and that conversion 
of D-mannose to AA by Prototheca is more rapid than conversion of D-glucose. 

Shake flask cultures of the mutant strain UV77-247 were grown to glucose 
30 depletion in standard liquid medium (Table 3). Cells were washed twice and resuspended 
in complete medium with the glucose substituted by one of the compounds listed below. 
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Cell suspensions were incubated for 24 hours at 35** C with shaking, and the entire 
suspension was extracted with TCA as above and assayed for AA. 

Tables 6-8 show that both growing and resting cells of strain UV77-247 can 
rapidly convert L-galactose and L-galactono-y-lactone to AA. In these experiments, 
D-fnictose and D-galactose were converted to AA at the same rate as D-glucose, 
suggesting that they are metabolized to AA through the same route as D-glucose. None 
of the organic acids suggested in the literature to be intermediates in the biosynthesis of 
AA were converted to AA, including sorbosone, which has been proposed as an 
intermediate by Saito et a/.(1990 Plant Physiol. 94:1496-1500). 



10 



15 



20 



25 



30 



TABLE 6 

Conversion of Compounds by Resting Cells of Strain U\/77-247 



Substrate (50 mM) 


Total AA, mg/L 


AA Relative to No 
Substrate Control 


L-galactose 


965 


623 


L-^alactono-Y-lactone 


918 


476 


D-fructose 


590 


248 


D-glucosone 


569 


247 


D-glucose 


584 


242 


D-galactose 


542 


200 


D-glucose (10 mM) 


388 


46 


D-gluconolactone 


382 


40 


D-gulono-Y-lactone 


366 


24 


D-glucuronate 


364 


22 


L-sortx>sone 


342 


0 


None 


342 


0 


2-keto-D-gluconlc ackJ 


341 


-1 


DHSoascortMC actd (10 mM) 


330 


-12 


D-glucuronolactone 


329 


-13 


D-gluconIc actd 


309 


-33 


D-galacturonic ackl 


297 


-45 


L-fdonate 


296 


-46 



35 



Since strain UV77-247 converted I^galactose and L-galactono-y -lactone to AA 
much more rapidly than it did glucose, it suggests that these compounds are intermediates 
in the AA biosynthetic pathway and that they are "downstream" from glucose. 

The data in Tables 7 and 8 also show that growing and resting cells of UV77-247 
consistently convert D-mannose to AA at a rate greater than that of glucose. 
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TABLE 7 



Conversion of Compounds to AA by Resting Celts of Strain USfn-TAl 



CompK)und 


Ascorbic Add, mg/L 


25.5 h 


30 h 


47 h 


L-galactose 


667 


718 


620 


L-galactono-Y-lactone 


644 


681 


749 


D-glucosone 


465 


462 


354 


D-mannose 


448 


462 


399 


D-fructose 


402 


408 


367 


d-glucose 


395 


404 


351 


D-galactose 


352 


361 


337 


none 


287 


288 


258 



TABLES 

Conversion of Compounds to AA t>y Growing Cells of Strain UV77-247 



Compound 


Ascort)ic Add, nr>g/L Aa^ | AA/Aezo 


25.5 h 


44h 


L-gatactose 


249 


506 


4.5 


112 


D-mannose 


228 


488 


5.6 


87 


L-galactono~Y-lactone 


214 


342 


5.0 


68 


D-glucose 


178 


398 


5.9 


67 


D-fructose 


181 


383 


5.9 


65 


[>giucosone 


176 


362 


5.7 


64 


O-galactose 


185 


380 


5.9 


64 


none 


182 


249 


4.4 


57 


D-gluconic actd (K) 


178 


262 


5.0 


52 


L-klonate (Na) 


182 


232 


4.7 


49 


2-keto-D-glu conic acid 


182 


255 


5.3 


48 


2-deoxy-D-glucose 


181 


227 


4.8 


47 


D-glucuronic ackJ lactone 


165 


218 


5.0 


44 


D-glucuronic acid (Na) 


173 


241 


5.6 


43 


L-g u lono-Y-lactone 


152 


195 


5.0 


39 


L-sorbosone 


178 


160 


4.7 


34 


ID-glucono-6-lactor>e 


130 


190 


5.7 


33 


O-galacturontc acid 


130 


180 


6.0 


30 



These cells converted L-galactose, L-galactono-Y-lactone and D-mannose to AA 
more rapidly than they did glucose, suggesting that mannose exerts its effect in the 
biosynthetic pathway "downstream" from glucose. 
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Example 5 

Using the methods described above, a collection of mutants was assembled. The 
specific AA formation for representative mutants are shown in Table 5. The genealogy 
of these isolates is presented graphically in the "family tree" of Fig. 3. 
5 These isolates were tested for their ability to convert compounds which could be 

converted to AA by stram UV77-247. Testing was done as in Example 4. Results are 
shown in Table 9. 

TABLE 9 

Conversion of Compounds to AA by Resting Celfs 
0 of Mutant Strains of ProtothecG of Varying Abllftios to Synthesize AA 



Strain 


Absolute AA, mg/L 


Buffer 


Glucose 


L-galactose 


L-gal-y-lacL 


Man nose 


Fructose 


EMS1S-4 


53 


97 


191 


173 


139 


ND 


UV127-10 


45 


140 


213 


140 


128 


143 


SP2-3 


19 


19 


204 


146 


24 


27 


NA21-14 


61 


80 


147 


158 


118 


115 


UV82-21 


15 


16 


183 


175 


18 


17 


UV213-1 


16 


15 


170 


135 


17 


16 


UV2ie-1 


16 


18 


136 


176 


19 


21 


UV244-1 


16 


16 


164 


162 


16 


16 


UV244-15 


26 


77 


30 


21 


94 


89 


UV244-16 


28 


64 


53 


53 


53 


66 



ND = Not Determined 



These data suggest that the mutational blocks in those strains which convert 
fructose and mannose to AA poorly are before ("upstream" from) L-galactose and 
L-galactono-y-lactone in the pathway. 

Example ^ 

The following example shows that magnesium inhibits eariy steps in the production 

of AA. 

To address the question of whether magnesium actually inhibits AA synthesis, 
strain NA45-3 (ATCC 209681) was grown in magnesium (Mg)-limited and Mg-sufl5cient 
medium. ATCC No. 209681, identified as Prototheca moriformis NA45-3 (Source: 



wo 99/64618 PCT/US99/1 1 576 

66 

repeated mutagenesis of ATCC No. 75669; Eucaryotic alga. Division Chlorophyta, Class 
Chlorophyceae, Order Chlorococcales), was deposited on March 13, 1998, with the 
American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 
201 10, USA, under the terms of the Budapest Treaty on the International Recognition of 
5 the Deposit of Microorganisms for the Purpose of Patent Procedure. Cells from both 
cultures were harvested and resuspended in the ceU-firee supemate from the Mg-limited 
culture, and to half of each cell suspension additional magnesium was added in order to 
bring the level in the suspension to the Mg-aifl5cient level. The four conditions under 
which assays were run were as follows. 



TABLE 10 

Conditions Used to Test the Effect of Magnesium on AA Production 



Condition 


Magnesium concentration. g/L, during: 


Growth 


Assay 


1Mg>1Mg 


0.02 


0.02 


1Mg>10Mg 


0.02 


0J2 


10Mg>1Mg 


0J2 


0.02 


10Mg>10Mg 


0.2 


0.2 



Substrates previously shown to lead to the formation of AA, namely D-glucose, 
D-glucosone, D-fructose. D-galactose, D-mannose, and L-galactono-Y-lactone, were 
added at 20 g/L to the four cell suspensions. Accumulation of AA after 24 hours was 
measured and con^ared to a control in which no substrate was added. The results of this 
study are shown graphically in Fig. 4. 

When cells growing under magnesium-limited conditions were incubated with 
substrates in low-magnesium broth (lMg>lMg condition), all showed significant and 
similar accumulation of AA over the control condition. When the same cells were 
incubated in high magnesium broth (lMg>10Mg condition), the accumulation of AA was 
reduced about 40% for all substrates except D-mannose and L-galactono-y-lactonc, 
suggesting that 1) the rate-limiting step in the conversion of D-glucose, D-glucosone, 
D-fructose, and D-galactose to AA is inhibited by magnesium or 2) magnesium stimulates 
an enzyme which results in the conversion of these compounds to some other 
compound(s), reducing the amount of substrate available for AA synthesis. On the other 
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hand, conversion ofD-mannose and L-galactono-ylactone appeared to be unaffected by 
the presence of magnesium in the resuspension buflFer, indicating that either 1) 
magnesium-inhibited enzymes are not involved in the conversion of these substrates to AA 
or 2) D-mannose and L-galactono-y-lactone enter the pathway far enough downstream 
5 from the point where they can be siphoned oflFby side reactions involving Mg-requiring 
enzymes. 

When cells were grovwi under magnesium-su£Scient conditions, very little AA 
accumulation from any of the D-sugars was observed, regardless of the level of 
magnesium in the resuspension broth. Accumulation of AA from L-galactono-y-lactone, 

10 however, was enhanced over that observed when cells are grown in Mg-limited 
conditions. This suggests that enzymes early in the pathway are repressed under 
Mg-sufl5dent conditions. Thus, the D-substrates all behaved similarly, with the exception 
of the apparent lack of magnesium inhibition of D-mannose conversion to AA. This 
would suggest that D-mannose enters the AA biosynthetic pathway at a point other than 

1 5 the other D-sugars. 

Figs. 2A and 2B represent some of the fates of glucose in plants. The first 
enzymatic step in this scheme which commits carbon to glycolysis is the conversion of 
fructose-6-P to fruaose-l,6-diP by phosphofructokinase (PFK). This reaction is 
essentially irreversible, and leads to the well known TCA cycle and oxidative 

20 phosphorylation, with concomitant ATP and NADH/NADPH generation. PFK has an 
absolute requirement for magnesium. If magnesium is limiting, this reaction could slow 
and eventually stop, blocking the flow of carbon through glycolysis and beyond, and 
would result in cessation of cell division even in the presence of excess glucose. One 
would expect fructose-6-P to accumulate under these conditions, fiieling AA synthesis by 

25 the pathway shown in Figs. 1 and 2. 



Example 7 

The following example shows the correlation in Prolotheca between AA 
production and the activity levels of the enzymes in the AA pathway. 
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Phoaphonriflnnose isomerase (PMT) Assay 

PMI activity was first assayed (See Fig. 1). Ten strains representing a range of 
AA productivities were grown according to the standard protocol to measure 
AA-synthesizing ability. CeUs were harvested 96 hours into magnesium-limited 
5 incubation, washed and resuspended in buffer containing 50 mM Tris/lO mM MgClj, pH 
7.5. The suspended cells were broken in a French press, sptm at 30,000 x g for 30 
minutes, and desalted through Sephadex G-25 (Pharmacia PD-10 cohimns). Reactions 
were carried out in the reverse direction by adding various volumes of extracts to 
solutions of Tris/Mg buffer containing 0.15 U phospho^ucose isomerase (EC 5.3.1.9), 

10 0.5 U ghicose-6-phosphate dehydrogenase (EC 1.1.1 .49), and 1 .0 mM NADP. Reactions 
were initiated by addition of 3 mM (final) mannose-6-phosphate. Final reaction volume 
was 1.0 mL. AH components were dissolved in Tris/Mg buffer. Activities were taken as 
the change in A^min, From these activities was subtracted the activities measured in 
identical reaction mixtures lacking the M-6-P substrate. Specific activities were calculated 

15 by normalizing the activities for protein concentration in the reactions. Protein in the 
original extracts was determined by the method of Bradford, using a kit fi^om Bio-Rad 
Laboratories (Hercules, CA). All enzymes and nucleotides were purchased fi-om Sigma 
Chemical Co. (St. Louis, MO). 



Phosphomannomutase fPMM) Assay 
20 Phosphomannomutase was measured in a similar manner in the same strains, but 

these assay reaction mixtures also contained 0.25 mM glucose- 1,6-diphosphate, 0.5 U 
commercially available PMI, and the reactions were started with the addition of 3 .0 mM 
(final) mannose-1 -phosphate rather than mannose-6-phosphate. 

Phosphofiiicto kinase fPFK;^ Assay 
25 To shed light on the possibility that the enhancement of AA concentration in 

cultures which were limited for magnesium was due to a diversion of carbon from normal 
metabolism by a reduced activity of the first committed step in glycolysis (PFK) the strains 
were also assayed to confirm the presence of this enzyme activity. Cells were cultured, 
washed and broken as above. Extracts were centnfijged at 100,000 x g for 90 min before 
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desalting. Reactions were carried out in the forward direction by adding various volumes 
of extracts to solutions of Tris/Mg buffer containing 1.5 mM dithiothreitol, 0.86 U 
aldolase (EC 4.1.2.13), 14 U a-glycerophosphate dehydrogenase (EC 1.1.1.8), 14 U 
triosephosphate isomerase (EC 5.3. l.l), 0.1 1 mMNADH, and 1.0 mM ATP. Reactions 
5 were initiated by addition of 5 mM (final) fiiictose-6-phosphate. Final reaction volume 
was 1.0 mL. AU components were dissolved in Tris/Mg buflFer. Activities were taken as 
the change in A34o/min. From these activities were subtracted the activities measured in 
identical reaction mixtures lacking the F-6-P substrate. Specific activities were calculated 
by normalizing the activities for protein concentration in the reaction. Protein in the 
10 original extracts was determined as above. 

GDP-D-mannose pyrophosphorvlase fGMP^ Assay 

These same mutant strains were assayed for the next enzyme in the proposed 
pathway, GMP. Strains were grown both according to the standard Mg-Iimiting protocol 
(harvested 43-48 hours into magnesium-limited incubation) and in standard Mg-sufficient 

15 medium (harvesting all cells before glucose depletion). Washed cell pellets were 
resuspended in 50 mM phosphate buffer, pH 7.0, containing 20% (v/v) glycerol and 0. 1 
M sodium chloride (3 mL buffer/g wet cells), and broken in a French press. Crude extracts 
were spun at 15,000 x g for 15 minutes. Reactions were carried out in the forward 
direction by adding various volumes of extracts to solutions of 50 mM phosphate/4 mM 

20 MgCIj buffer, pH 7,0, containing 1 mM OTP. Reactions were initiated by addition of 1 
mM (final) mannose-1 -phosphate. Final reaction volume was 0.1 mL. Reaction mixtures 
were incubated at 30 C for 10 min, filtered through a 0.45 ^m PVDF syringe filter, and 
analyzed for GDP-mannose by HPLC. A Supelcosil SAXl column (4.6 x 250 mm) was 
used with a solvent gradient (1 mL/min) of: A - 6 mM potassium phosphate, pH 3.6; B 

25 - 500 mM potassium phosphate, pH 4.5. The gradient was: 0-3 min, 100% A; 3-10 min, 
79% A; 10-15 min, 29% A Column temperature was 30 C. Two assays that showed 
enzyme activity proportional to the amount of protein were averaged. Control 
no-substrate and no-extract reactions were also run. Specific activity was calculated by 
normalizing the activity for protein concentration in the reaction. Protein in the original 

30 extracts was determined as above. 
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GDP-D-TnannQse:GDP-r>-ga1actose Epimera<ie Assay 

Further tests measured the activities of the next enzyme in the proposed pathway, 
GDP-D-mannose:GDP-L-gaIactose epimerase. Strains were grown according to the 
standard protocol, harvested 43-48 hours into magnesiunvJimited incubation, washed, and 
5 resuspended in buflfer containing 50 mM MOPS/5 mM EDTA, pH 7,2. Washed pellets 
were broken in a French press, and ^un at 20,000 x g for 20 min. Protein determinations 
were made as above and a dilution series of each was made, ranging from 0.4 to 2.2 mg 
protein/mL. 50 jxL aliquots of theise dilutions were added to 10 ^L aliquots of 6.3 mM 
GDP-D-mannose in which a portion of this substrate was universally labeled with in 

10 the mannose moiety. This substrate had an activity of 16 ^Ci/mL before dilution into the 
reaction mixture. Reactions were stopped after 10 min by transferring 20 ^L of the 
mixture into microfuge tubes containing 20 of 250 mM trifluoroacetic acid (TFA) 
containing 1.0 each D-mannose and L-galactose. These tubes were sealed and boiled 
for 10 min, cooled, spun for 60 sec in a Beckman Microfuge E, and 5 |iL of each 

1 5 hydrolysate was spotted on 20 x 20 cm plastic-backed EM Science Silica gel 60 thin-layer 
chromatogn^hy plates (#5748/7), with 1 cm lanes created by scoring with a blunt stylus. 
After drying, plates were twice chromatographed for 2.5 hours in ethyl 
acetate:isopropanol:water, 65:22.3 : 12.7 (plates were dried between runs). Spots of free 
sugars were visualized by spraying dried plates with 0.5% p-anisaldehyde in a 62% 

20 ethanolic solution of 0.89 M sulfiiric acid and 0.17 mM glacial acetic acid, and heating at 
105 C for about 15 min. Spots of L-galactose and D-mannose were cut from the plates 
and counted in a scintillation counter (Beckman model 2800). For time-zero control 
counts, 16.7 fiL of each extract dilution was added to 23.3 |iL of the labeled substrate 
above, which had been diluted 1:7 with the TFA/mannose/galactose solution. 

25 Table 1 1 summarizes the results of the five enzyme assays for the strains tested, 

along with their specific AA formations. 
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TABLE 11 

Specific Enzyme Activities (mU)* of Selected Mutant ProtothecQ Strains 



Strain 


AA Specific 
Form, mg/g 


PMI 


PMM 


PFK 


GMP 


Epimerase 


Mg- 
Itmited 


Mg- 
suffictent 


1 nf'ttiA a. 














0.79 


ClJIOi-a A 
CIVIO 1 




in A 

lU.O 


AO R 
O9.D 


10. 3 


2.6 


6.8 


0.78 
















f\ TO 


NA45-3 


61.4 












0.58 


UV77-247 


44.4 












0.52 


UV1 27-10 


40.1 


11.1 


45.8 


24.4 


4.3 


5.9 


0.39 


UV244-15 


24.5 


14.3 


41.5 




3.1 


5.3 


0.42 


NA21.14 


23.6 


12.1 


60,3 


47.4 


2.4 


7.6 


0.27 


ATCC 75669 


21.9 












0.28 


UV244-16 


5.0 


16.5 


85.6 




4.3 


5J2 




SP2-3 


2.0 


17.7 


47.0 


64.5 


2.0 


7.5 


0.03 


UV216-1 


0.4 


15.9 


72.1 




2.7 


7.0 


0.83 


UV213-1 


0.1 


19.7 


47.7 


32.6 


3.2 


6.7 


0.60 


UV82-21 


0.0 


14.6 


70.6 


30.4 


4.1 


7.5 


0.15 


UV244-1 


0.0 


18.2 


51.1 




5.5 


12 


0.15 



Units: PMI and PMM. nmoles NADP reduced per min/mg protein; PFK, nmoles NADH oxidized per 
20 mlnAng protein; GMP, nmoles GDP-D-mannose formed per minAng protein; epimerase, nmolee 

GDP-L-galactose formed per min/mg protan. 

The only enzyme which showed a strong correlation between activity and the 
ability to synthesize AA was the GDP-D-mannose;GDP-I^galactose epimerase. This 
correlation is depicted in Fig. 5. All of the strains which produced mesisurable gunounts 

25 of AA had measurable amounts of epimerase activity. The converse was not true: four 
of the strains which synthesize little or no AA had significant epimerase activities. These 
strains are candidates for having mutations which aflFect enzymatic steps downstream from 
the epimerase. Since all of the strains tested can synthesize AA from L-galactose and 
L-galactono-y-lactone (see Examples 4 and 5), the genetic lesion(s) in these four mutants 

30 must lie between GDP-L-galactose and free L-galactose, 
Example 8 

The next example shows the relationship between GDP-D-mannose: GDP-L- 
galactose epimerase activity and the degree of magnesium limitation in two strains, the 
original unmutagenized parent strain ATCC 75669, and one of the best AA producers, 
35 EMS13-4 (ATCC ). 
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Four flasks of each strain were grown according to the standard protocol. One 
culture of each was harvested 24 hours into magnesium-limited incubation, and every 24 
hours thereafter for a total of four days. One flask of each strain was also harvested 24 
hours into magnesium su£5cient incubation. All cultures had glucose remaining when 
5 harvested. Fig. 6 shows graphically the AA productivity and epimerase activity in 
EMS 13-4 and ATCC 75669 as the cultures became Mg-limited. Epimerase activity in 
EMS 13-4 was significantly greater than that in ATCC 75669 at all time points. There was 
also a concurrent rapid rise in both AA productivity and epimerase activity in EMS 13-4 
as the cultures became increasingly Mg-limited. While there was a moderate increase in 
10 AA productivity in ATCC 75669 as Mg became more limiting, there was no effect on 
epimerase activity. 
Example 9 

The following example shows the results of epimerase assays performed with 
extracts of two K coli strains into vs^ch were cloned the E. coli gene for GDP-4-keto-6- 
15 deoxy-D-mannose epimerase/reductase. 

The E, coli K12 wca gene cluster is responsible for cholanic add production; 
wcaG encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. 

The E. coli wcaG sequence (nucleotides 4 through 966 of SBQ ID NO:3) was 
amplified by PCR from E. coli W3110 genomic DNA using primers WG EcoRI 5 (5' 
20 TAGAATTCAGTAAACAACGAGTTTTTATTGCTGG 3'; SEQ ID NO: 12) and WG 
Xhol 3 (5* AACTCGAGTTACCCCCAAAGCGGTCTTGATTC 3'; SEQ ID NO:13). 
The 973-bp PCR product was ligated into the vector pPCR-Script SK(+) (Stratagene, 
LaJolla, CA). The 973-bp ExoRII/XhoI fragment was moved from this plasmid into the 
ExoKH/XhoI sites of pGEX-5X-l (Amersham Pharmacia Biotech, Piscataway, NJ), 
25 creating plasmid pSW67-l . Plasmid pGEX-5X-l is a GST gene fusion vector which adds 
a 26-kDa GST moiety onto the N-terminal end of the protein of interest, E. coli 
BL21(DE3) was transformed with pSW67-l and pGEX-5X-l, resulting in strains 
BL21(DE3)/pSW67-l and BL21(DE3)/pGEX-5X-l. 

The£. coli wcaG sequence (nucleotides 1 through 966 of SEQ ID NO:3) was also 
30 amplified by PCR from E. coli W3 1 10 genomic DNA using primers WG EcoRI 5-2 (5' 
CTGGAGTCGAATTCATGAGTAAACAACGAG 3'; SEQ ID NO: 14) and WG PstI 3 
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(5* AACTGCAGTTACCCCCGAAAGCGGTCTTGATTC 3'; SEQ ID NO: 15), The 976- 
bp PGR product was ligated into a pPCR-Script (Stratagene). The 976-bp ExoRII/PstI 
fragment was moved from this plasmid into the ExoRII/PstI sites of expression vector 
pKK223-3 (Amersham Pharmacia Biotech), creating plasmid pSW75-2, K coU JM105 
5 was transfomied with pKK223-3 and pSW75-2, resulting in strains JM105/pKK223-3 and 
JM105/pSW75-2. 

All six strains were grown in duplicate at 37*'C with shaking in 2X YTA medium 
until an optical density of 0.8-1 .0 at 600 nm was reached (about three hours). 2X YTA 
contains 16 g/L tryptone, 10 g/L yeast extract, 5 g/L sodium chloride and 100 mg/L 

10 ampicillin. One of each culture was induced by adding isopropyl P-D- 
thiogalactopyranoside (IPTG) to 1 mM final concentration. All 12 cultures were 
incubated for an additional four hours, washed in 0.9% NaCl, and the cells were frozen 
at -80**C. Prior to pelleting the cells for preparation of extracts, a portion of each culture 
was used for a plasmid DNA miniprep to confirm the presence of the appropriate plasmids 

15 in these strains. A protein preparation of each culture was also nm on SDS gels to 
confirm expression of a protein of the appropriate size where expected. Frozen pellets 
were thawed, resuspended in 2.5 mL MOPS/EDTA buflfer, pH 7.2, broken in a French 
Press (10,000 pa), spun for 20 min at 20,000 x g, assayed for protein as above and diluted 
to 0.01, 0.1, 1.0 and 3 mg/mL protein. 

20 Induction of the strain BL2 1 (DE3)/pGEX-5X- 1 resulted in high-level expression 

of a 26-kDa protein indicating the synthesis of the native GST protein. Induction of strain 
BL21(DE3)/pSW67-l resulted in high-level expression of a 62-kDa protein, indicating the 
synthesis of the native GST protein (26K) fiised to the wcoG gene product (36K). An 
aliquot of the fijsion protein was treated with the protease Factor Xa (New England 

25 Biolabs, Beverly, MA), which cleaves near the GST/wcaG junction. Induction of the 
strain JM105/pSW75-2 resulted in high level expression of a 36-kDa protein, indicating 
the synthesis of the wcaG gene product. No such protein was detected in 
JM105/pKK223-3 (vector only). 

Next, it was of interest to test extracts in the standard epimerase assay described 

30 in Example 7 to determine if any of the extracts containing the wcaG product could bring 
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about the conversion of GDP-D-mannose to GDP-L-galactose. The extracts to be 

assayed are: 

BL21(T)E3;^ Group 

1. BL21(DE3) uninduced 

5 2. BL21(DE3) induced with ImM IPTG 

3. BL21(DE3)/pGEX-5X-l uninduced 

4. BL21(DE3)/pGEX-5X-l induced with ImM IPTG 

5 . BL2 1 (DE3)/pSW67-l uninduced 

6. BL21^E3ypSW67-I induced with 1 mM IPTG; fusion protein intact 

10 7. BL21(DE3)/pSW67-l induced with 1 mM IPTG; GST moiety cleaved 

JM105 Group 

1. JM105 uninduced 

2. JM105 induced with ImM IPTG 

3. JM105/pKK223-3 uninduced 

15 4. JM105/pKK223-3 induced with 1 mM IPTG 

5. JM105/pSW75-2 uninduced 

6. JM105/pSW75-2 induced with 1 mM IPTG 

Extracts 1 and 7 from the BL21(DE3) group and extracts 1 and 6 from the JM105 
group were tested for GDP-D-mannose:C35P-L-galactose epiraerase-Hke activity in a pilot 

20 experiment. In this initial experiment, no epimerase activity was detected in any of the 
extracts. At this time, such a result can be attributed to a number of possibilities. First, 
it is possible that the wcaG gene product is incapable of cataly2dng the conversion of 
GDP-D-mannose to GDP-L-galactose, although this conclusion can not be reached until 
several other parameters are tested. Second, it is possible that under the assay conditions 

25 which are satis&ctory to measure activity for the endogenous GDP-D-mannose:GDP-L- 
galactose epimerase, the wcaG gene product does not have GDP-D-mannose:GDP-L- 
galactose epimerase-like activity. Therefore, alternate conditions should be tested. 
Additionally, confirmation experiments should be performed to confirm the accuracy of 
the pilot conditions. Third, although the BL21(DE3) and the JM105 clones produce 

30 proteins of the expected size, the constructs have not been sequenced to confirm the 
proper coding sequence for the wcaG gene product and thereby rule out PCR or cloning 
errors which may render the wcaG gene product inactive. Fourth, the protein formed 
Srom the cloned sequence is flill-length, but inactive, for example, due to incorrect tertiary 
stnicture (folding). Fifth, the gene is overexpressed, resulting in accumulation of insoluble 

35 and inactive protein products (inclusion bodies). Future experiments will attempt to 



wo 99/64618 



75 



PCTAJS99/11576 



determine whether the constructs have or can be induced to have the ability to catalyze 
the conversion of GDP-D-mannose to GDP-L^galactose, and to use the sequences to 
isolate the endogenous GDP-D-mannose:GDP-l^gaIactose epimerase. 

Table 12 provides the atomic coordinates for Brookhaven Protein Data Bank 
5 Accession Code 1 bws: 



TABLE 12 





HEADER 


EPIMERASE /REDUCTASE 27-SEP-98 IBWS 




XZTIrfi 


CRYSTAL STRUCTURE OF GDP-4-KETCK6-DEOXY-D-MANWOSE 




TITLE 


2 EPIMERASE/REDUCTASE FROM ESCHERICHIA COLI A KEY ENZYME IN 


10 


TITLE 


3 THE BIOSYNTHESIS OF GDP-L-FOCOSE 




COMPND 


MOL ID: 1; 




CPMPNP 


2 MOLECULE: GDP-4-KETO-6-DEOXY-D-MAHNOSB EPIMERASE/REDUCTASE; 




COMPND 


3 CHAIN: A: 




C«1PNI? 


4 ENGINEERED: YES; 


15 




5 BIOLOGICAL UNIT: H(»40DIMER 






MOL ID: 1; 




5OTRCB 


2 ORGANISM SCIENTIFIC: ESCHERICHIA COLI; 




SOURCE 


3 EXPRESSION SYSTEM: ESCHERICHIA COLI 




KB WPP 


EPIMERASE/REDUCTASE. GDP-L-FUCOSE BIOSYNTHESIS 


20 


EXPDTA 


X-RAY DIFFRACTION 






DE M.RI2ZITONETTIFLORA 




REVDAT 


1 13-JAN-99 IBWS 0 






AUTH DE D.RIZZITONETTIVIGBVANISTURLABISSOFLORA 




7mh 


TITL GDP-4-KETO-6-DEOXYD-MANNOSE EPIMERASE/REDUCTASE 


25 


JRNT, 


TTTL 2 FROM ESCHKRTCHTA COLT. A KEY ENZYME IN THE 




TONL 


TITL 3 BIOSYNTHESIS OF GDP-L-FUCOSE, DISPLAYS THE 




JRNL 


TITL A STRUCTURAL CHARACTERISTICS OF THE RED PROTEIN 




JRNL 


TTTL 5 HOMOLOGY SUPERFAMTLY 




JRNL 


REF STRUCTURE fLONDON) 1998 


30 


^RNIi 


REFN 9999 




RgMARK 


1 






2 






2 RESOLUTION. 2.2 ANGSTRCMS . 




REMARK 


3 


35 




3 REFINEMENT, 






3 PROGRAM : TNT 






3 AUTHORS : TRONRUD.TBN EYCK . MATTHEWS 




REMARK 


3 






3 DATA USED IN REFINEMENT. 


40 


REMARK 


3 RESOLUTION RANGE HIGH (ANGSTROMSl : 2.2 






3 RESOLUTION RANGE LOW (ANGSTROMS) : 15,0 
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REMARK 


3 


PAIA OTTOrr (SIWIA (D ) 


: 0.0 










2 


COMPLETENESS FOR RANGE 


: 99.7 








REMARK 


3 


NUMBER OF RKFT.ECTrONS 


: 24481 










3 










5 




3 


USING DATA ABOVE SIQIA CUTOFF. 












3 


CROSS-VALIDATION METHOD 


: NONE 




REMARK 


3 


FREE R VALUE TEST SET SELECTION 


: NULL 






3 


B^jyALUB (WORKING + TEST SET^ 


: NULL 




REMARK 


3 


B.JVALTO (WORKING SET> 


: NONE 


10 




3 


FREE R VALUE 


: NULL 






3 


FREE R VALUE JEST SET SIZE (%1 


: NONE 






3 


FREE R VALUE TEST SET COUNT 


: NULL 






3 














3 


USING ALL DATA. NO SIGMA CUTOFF. 








15 




3 


R VALUE (WORKING + TR55T SET. NO 


CUTOFF) 


• NULL 






3 


R VALUE (WORKING SET. NO 


CUTOFF! 


0.202 






3 


FREE R VALUE (no 


CUTOFF) 


0.2B7 




REMARK 


3 


FREE R VALUE TEST SET SIZE f%, NO 


CUTOFF! 






REMARK 


3 


FREE R VALUE TEST SET COUNT (NO 


CUTOFF) 


NULL 


20 




3 


.....TOTAL NUMBER OF REFLECTIONS (NO 


cuTPfF) • 


NULL 




REMARK 


3 










REMARK 


3 


NUMBER OF NON-HYDROGEN ATOM.*? nSPin TM RRT^TMWhrmT 






REMARK 


3 


PROTEIN ATMS : 2527 












3 


NUCLEIC ACID ATCM3 : NULL 








25 




3 


OTTHER ATOMS : 109 










REMARK 


3 












REMTVRK 


3 


WILSON B VALUE (FROM FCAI.C, A**Z) : 










REMARK 


3 












REMARK 


3 


RMS DEVIATIONS FROM IDEAL VALUES. 


RMS 


WEIGHT 


COUNT 


30 




3 


BOND LENGTHS (A) : 


0.016 ! 


NULL ; 


NULL 




REMARK 


3 


_BOND ANGLES f DEGREES > 




NVLIf ; 






REMARK 


3 


TORSION ANGLES (DEGREES) : 


NULL / 


NULL ; 


NULL 




REMARK 


3 


PSEUDOROTATION ANGLES (DEGREES i r 


NULL ; 


NULL ; 


NULL 




REMARK 


3 


TRIGONAL CARBON PLANES (A) r 


NULL ; 


NVLIf ; 


NULL 


35 




3 


GENERAL PLANES (A) : 


NULL ; 


NULL ; 


NULL 




REMARK 


3 


.. ISOTROPIC THERMAL FACTORS (A**7) z 


NULL ; 


WLL ; 


NVLIf 






—1. 


NON-BONDED CONTACTS (A) : 


NULL ; 


NULL ; 


NULL 




REMARK 


3 












REMARK 


3 


INCORRECT CHIRAL-CENTERS fCOUNTl : 


NULL 






40 


REM7SJW 


3 












REMARK 


3 


BULK SOLVENT MODELING. 










REMARK 


3 


METHOD USED : NULL 










REMARK 


3 


KSOL : NULL 










REMARK 


3 


BSOL : Nm,L 








45 


REMARK 


3 
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10 



15 



20 



25 



30 





3 


RESTRAINT LIBRARIES. 






3 


_ STEREOCHQIISTRY : NULL 






3 


ISOTROPIC THERMAL FACTOR RESTRAINTS • NITLT. 












3. 


OTHER REFINEMENT REMARKS: NULL 




REMARK 












1BW3 CCMPLTES WITH FORMAT V. 2.2. 


16-DEC-1996 












5 


WARNING 






5 


isw;?; xaz? J? iayer i eslsasb. 




REMARK 


5 






REMARK 


5 


FLBA5B W9TB THIS ENTRY RBLfift^EP AP^W PRPOSJTPR 




? 


?HB*?KIN<? AW APP^QVhL POT fflXHQVT PPB SXAffT INTBRVEifriQN , 


REMARK 




AN AUXILIARY FILE, AUX1BWS,RPT. IS AVAILABLE FRCM THE 






PDB FTP SERVER AND IS ACCESSIBLE 


THROUGH THE 3DB BROWSER. 






THE _FILE CONTAINS THE OUTPUT OF THE PROGRAM WHAT CHECK AND _ 






OTHER DIAGNOSTICS. 




BEM^SiL. 


5 










NOMENCLATURE IN THIS ENTRY, INCLUDING HET RESIDUE NAMES 




5 


AND HET ATOM NAMES, HAS NOT BEEN 


STANDARDIZED BY THE PDB 




5 


PROCESSING STAFF. A LAYER 2 ENTRY WILL BE RELEASED SHORTLY 




5 


T^FTER THIS STANDARDIZATION IS COMPLETED AND APPROVED BY THE 




5 


DEPOSITOR. THE LAYER 2 ENTRY WILL BE TREATED AS A 




5 


CORRECTION TO THIS ONE. WITH THE 


APPROPRIATE REVDAT RECORD. 




5 








5 


FURTHER INFORMATION INCLUDING VALIDATION CRITERIA USED IN 






CHECKING THIS ENTRY AND A LIST OF MANDATORY DATA FIELDS 




5 


ARE AVAILABLE FROM THE PDB WEB SITE AT 




5 


HTTP ; //^WWff . PPP . BNL . GQY/ . 




REMARK 


200 








200 


EXPERIMBNTAi PETAILS 




RSWARK 


200 


EXPERIMENT TYPE 


X-RAY DIFFRACTION 


REMARK 


200 


DATE OF DATA COLLECTION 


AUG-1997 




200 


XEMPBRATTOP fKEIiYIW) 


120 




200 


PH 


6.5 




200 


NUMBER OF CRYSTALS USED 


1 




200 








200 


5YW<;WOiRON (x/JN) 


N 




200 


RADIATION 5PVRCE 


WNE 


BEMARK 


200 


EEAmiIHE 


NULL 




200 


X-RAY GENERATOR MODEL 


: RIGAKU RU200 


REMARK 


200 


MONOCHRCMATIC OR LAUE (M/Ll : 


M 


REM7VRK 


200 


WAVELENGTH OR RANGE (A) 


I. 5416 


R£»AiU^ 


200 


MQNOCHRCWATOR 


: NUIiI. 



35 



40 



45 



REMARK 200 OPTICS 



NULL 
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200 








REMARK 


200 


DETECTOR TYPE 


: IMAGE PLATE 






200 


DETECTOR MANUFACTURER 


: RAXIS 




REMARK 


200 


TNTENSITY-TNTEGRATION SOFTWARR 


: MOSFLM 


5 




200 


DATA SCALING SOFTWARE 


: SCALA 






200 










200 


NUMBER OF UNIOUE REFLECTIONS 


! 24481 






200 


RESOLUTION RANGE HIGH fA^ 


: 2.2 






200 


RESOLUTION RANGE LOW fA^ 


: 15.0 


10 




200 


REJECTION CRITERIA rsiGMAfin 


: NONE 






200 










200 


OVERALL. 








200 


CCMPLETENESS FOR RANGE fft^ 


: 99.7 






200 


DATA REDUNDANCY 


: 4.3 


15 


REMARK 


200 


R MERGE fT) 


: 0.057 






200 


R SYM (r\ 


: NONE 






20O 


<I/SIGMA(I)>. FOR THE DATA SET 


: 13.6 






200 










290 


IN THE HIGHEST RESOLUTION SHELL. 




20 




290 


HIGHEST RESOLUTION SHELL, RANGE 


HIGH fAl : NULL 






200 


HIGHEST RESOLUTION SHELL. RANGE 


LOW fA> : NULL 






200 


CCMPLETENESS FOR SHELL fft^ 


: NULL 






299 


DATA REDUNDANCY IN SHELL 


: NULL 






200 


R MERGE FOR SHELL (1\ 


: NULL 


25 




290 


R SYM FOR SHELL (11 


: NULL 




REMARK 


290 


<T/STMV(T^> FOR SHELL 


: NULL 






200 










200 


DIFFRACTION PROTOCOL: NULL 








200 


METHOD USED TO DETERMINE THE STRUCTURE: MTR 


30 




290 


SOFTWARE USED: NULL 






REMARK 


299 


STARTING MODEL: NULL 








290 










200 


REMARK: NULL 








290 






35 




290 


CRYSTAL 








290 


SOLVENT CONTENT. VS (%) : NULL 








280 


MATTHEWS COEFFICIENT, VM f ANn.qTPnM?;**^ /HAl • KTTTT.T. 




REM?UIK 


280 








REMARK 


290 


CRYSTALLIZATION CONDITIONS: NULL 




40 




2?0 








REMARK 


2?0 


CRYSTALLOGRAPHIC SYMMETRY 








290 


SYMMETRY OPERATORS FOR SPACE GROUP: P 32 2 1 






290 










290 


SYMOP SYMMETRY 




45 




290 


NNNMMM OPERATOR 
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290 


1555 




X. Y. Z 












2?0 


2555 




-Y.X-Y,Z+2/3 










290 






Y-X--X,Z+l/3 










2?0 


4555 




Y.X,-Z 








5 




Z9Q 


5555 






-z 








RPlftfiK 








-X.Y-X.2/3 


-z 






























WHERE NNN 


r -> OPERATOR NUMBER 










2?9 


MMM -> TRANSIATTOM VTVTTnn 




10 




290 


















290 


CRYSTAILOGRAPHTC SYMMRTRY TPaMqpnTmXTTnM*; 








29Q 


THE POLLOWTNG 


TRANSFORMATIONS OPERATE ON THE ATOM /HPIT ATM 






2?Q 


RECORDS IN THIS ENTRY TO PRODUCE 


CRYSTALLOGRAPHI CALL Y 






2?0 


RELATED MOLECUI.ES . 








15 




2?0 


SMTRVl 


1 


1.000000 


P. DUDDOQ 


0 . 000000 


0.00000 






2?0 


SMTRY2 


1 


0.000000 


1.000000 


0.000000 


0.00000 






2?0 


SMTRY3 


I 


o.OQPOOO 


0.000000 


1.000000 


0.00000 






290 


SMTRYl 


2 


-0.500045 - 


-0.865974 


0.000000 


0 . 00000 




REM7VRK 


290 


SMTRY2 


2 


0.866077 - 




0.000000 


0.00000 


20 




2?0 


SMTRY3 


2 


0.000000 


0.000000 


1.000000 


50 . 56553 






2?9 


SMTRYl 


3 


-0.499955 


o.?$597^ 


0,000000 


0,00000 






2?g 


SMTRY2 


3 


-0.9§§977 - 


•0,500015 


0.000000 


0. 00000 






2?P 


SMTRY3 


:? 


0.000000 


0.000000 


1.000000 


25.29276 






290 


5MXRX1 


1 


-0.500045 


0.9§5922_ 


0.000000 


0. 00000 


25 




290 


?m'RY2 




0.866077 


P-5Q0015 


0.000000 


0.00000 






290 


SMTRY3 




0.000000 


0.000000 


-1.000000 


0.00000 




REMARK 


290 


SMTRYl 


5 


1.000000 


0.000104 


0.000000 


0.00000 






290 


SMTRY2 


5 


0.000000 - 


■1.000000 


0.000000 


0.00000 




REMARK 


290 


SMTRY3 


5 


0.000000 


0.000000 


-1.000000 


25.29276 


30 


RBM?iRK 


290 


SMTRYl 


^ - 


-0.499955 - 


0,666026 


0.000000 


0.00000 




REMARK 


290 


SMTRY2 


6 - 


-0.866077 


0,499955 


0.000000 


0.00000 






290 


SMTRY3 




0.000000 


0.000000 


-1.000000 


50.58553 






290 
















REMARK 


290 


REMARK: NULL 










35 


REMARK 


^§5 
















REMARK 




MISSING RESIDUES 












d £ C 


THE FOLLOSTTNG 


RESIDUES WERE NOT LOCATED TN TTTR 










EXPERIMENT. 


fM-MODEL NUMBER: RE<?= 


RE3IDUS NftMBr-. 


C=CHAIN 






165 


IPENTIFIER; 


S3SEO«SEOUENCE NUMRKR 


; I"INSERTTON 


CODE) : 


40 




465 
















REMARK 




M RES C SSSBOT 










BJPMAPK 




MET A 




1 










REMARK 




SER A 




2 












1§5 


A5P A 


317 








45 




^65 


ARG A 


318 
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PHE 


A 


319 


























ARG A 


320 


























GLY 


A 


321 






















600 


























5 




800 


SITE 
























REMARK 


900 


SITE TDENTTFTER: CAT 


















RSMftRK 


SOO 






















REMARK 


900 


CATALYTIC RESIDUE 






















590 


























10 


REMARK 


900 


SITE IDENTIFIER! CAT 




















900 


SITE DESCRIPTTOM: 






















900 


CATALYTIC RESIDUE 






















900 






























900 


SITE IDENTIFIER: CAT 
















15 




900 


SITE DESCRIPTION: 






















900 


CATALYTIC RESIDUE 




















REM7VRK 


900 
































3 


316 SWS 


P32055 


FCL ECOLT 










1 A 


321 


K6T 


SER T.YS 


GLN ARG VAL 


PHE 


ILE 


ALA GLY HIS 


ARG 


GLY 


20 






h 


321 


MET 


VAIr <?LY 


SER 


ALA 


ILE 


ARG 


,AR<? 


GLN LEU GLU 


GLN 


ARG 










321 


GLY 


ASP VAL 


<?LV 


LEV 


VAL 


LEU 


ARG 


THR ARG ASP 


GLU 


LEU 








A 


321 




L&V LEV 


ASP 


SER 


ARG ALA 


VAL 


HIS ASP PHE 


PHR 


ALA 








A 


?21 


5BR 


AR(? 


ILE 


ASP 


GLN 


V7^ 


TYR LEU ALA ALA TOA LYS 








A 


?21 


VAL 




XLE: 


VAL 


ALA ASN ASN 


THR TYR PRO TOA ASP 


25 




7 


A 


321 


FHS 


ILE TYR 


GLN ASN 


MET 


MET 


ILE 


GLU SER ASN 


ILE 


ILE 




SEORES 


9 


A 


321 


HIS 


ALA ALA 




GLN ASN ASP 


VAL 


ASN LYS LBV 


LEV 


PHE 






? 


A 


321 


LSV 


<51iY SER 


SER 




ILE 


TYR 


PRO 


LYS LEU ALA 


LYS 


GLN 




SSQBZS- 


10 


A 


321 


PRO 


MET ALA 


<?LV 


SER 


GLU 


LEV 


LEV 


GLN GLY THR, 


LEV 


GLV 




S£QBE5_ 


11 


A 


321 


PRO 


THR ASN 


GLV 


PRO 


.im 


ALA 


ILE 


ALA LYS TLE 


ALA 


GLY 


30 




12 


A 


321 




L:fS LEV 


CXS 


OLV 


SER 


TYR 


ASN 


ARG GLN TYR 


GLY 


ARG 




SEORES 


13 


A 


321 


ASP 


TYR ARG 


SER VAL 




PRO 


THR 


ASN LEU TYR 


GLY 


PRO 




2 SORBS 


14 


A 


321 


HJ5 


ASP ASN 


PHS 


HJS 


PRO 


SER 


ASN 


SER HIS VAL 


ILK 


PRO 




SEORES 


15 


A 


321 


ALA 


LEU LEU 


AR<? 


ARG 




HIS 


GLV 


ALA THR ALA GLN ASN 






16 A 


321 


ALA 


PRO Asp 


VAL 


YAL 


VAL 


TRP 


GLY 


SER GLY THR 


PRO MET 


35 


SEORES 


17 


A 


321 


AR<? 


PHB 


LW 


ms 


VAL ASP ASP MET ALA ALA TILA SER 




SEORES 


19 


A 


321 


ILEi 


HIS VAL 


MET 


OLV 


LRV 


ALA 


HIS 


GLU VAL TRP 


LEU 


GLU 




SEOR£S 


1? 


A 


321 


ASN 


THR GLN 


PRO 


MET 


LEV 


SER 


HIS 


.-ILE ASN YAL 


GLY 


THR 






20 


A 


321 


GLY 


YAL A5P 


CXS 


THR 


ILE 


ARO 


ASP 


VTO, ALA GLN 


THR 


ILE 




SEORES 


21 


A 


321 


ALA 


LY3 V?CL 


YAL 


QLY 


TYR 


LYS 


GLY 


ARG YAL YAL 


PHE 


ASP 


40 




22 


A 


321 


ALA 


SER LYS 


PRO 


ASP 


GLY 


JUEiR 


PRO 


ARG LYP LEV 


LEV 


ASP 




SEORES 


2? 


A 


321 


VAL 


THR ARG 


LEV 


HIS 


GLN 


LBV 


GLY 


TRP TYR HIS 


GLU 


ILE 






24 


A 


321 


SBR 


LBV <?LU 


ALA 


OLV 


LEU 


AW^ 


S]PR 


THR TYR GLN TRP 


PHE 




SEORES 


2^ 


A 


321 


f.EU 


GLV A5H. 


<^LN 


ASP 


ARG 


PHE 


ARG 


GLY 








HET 


MPP 




1 




0 


















45 


HfiTOftM 




NDP NICOTINAMIDE ADENTNIE 


: DINUCLEOTIDE PHOSPHATE 
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HETSYN 



WPP NftPg 



FORMUI. 2 MPP 



QZl H23 W? PI7 P3 3-.. 



XQEMUIi 3 HOH * 109^2 OH 



HELTy 


1 


1 


MBT A 


14 


eiiN A 


25 


— ^ 














12 




2 


2 


SBR A 


44 


GLU A 


54 


1 
















HELTX 


3 


3 


ILE A 


69 


THR A 


74 


1 






















PR(? A 


76 


ASN A 


97 


1 














22 


HKLTX 


5 




SSR A 


108 


ILE A 


110 


5 














3 




6 


6 


CrlfV A 


121 


5LV A 


123 


5 














3 


HELIX 


7 


7 


GLU A 


134 


A 


154 


1 














21 




8 


9 


VAL A 


199 


ALA A 


133 


.1 














14 


HELIX 


9 


? 


VAL A 


214 , 


QliV A 


22^ 


1 














13 




10 


10 


K2S A 


2?9 


av A 


234 


1 














6 




11 


11 




253 


VAL A 


264 


1 














12 


HELIX 


12 


12 


THR A 


288 


5LH A 


292 


1 














.. 5 .. 




13 


13 


LEU A 


301 


GLU A 


314 


1 














14 .. 




1 


A 


5 VAI. A 2? 


VAL A 


32 


0 


















2 


A 




A 4 


ALA A 




1 


N 


GLN A 


4 


o 


GLU A 


30 








A , 


6 GLN 


A 59 


LEV A 


^1 


1 




<?LN A 


59 


.P 


PfiS A 


7 






4 


A 




A IPl 


LEU A 


105 


1 




LYS A 


IPl 


P 


VAL A 


59 




SHEET 


5 


A 


6 ASP 


A 157 


PRO A 


1^3 


1 


N 


ASP A 


157 


..Q, 


A 


102 




SHE&T 


6 


A 


6 ILE 


A 243 


VAL A 


245 


1 




ILE A 243 


o 


MET A 


162 




££BS£X_ 


1 


B 


2 ASM A 165 


TYR A 


1^17 


p 
















SHEET 


2 


& 


2 PHE 


A 211 


HI? A 


213 


1 


N 


LEU A 


212 


0 


ASM A 165 






1 




2 ASP A 198 


TRP A 


202 


0 
















SHEET 


2 


C 


2 ARG A 269 


ASP A 


273 


1 


W 


ARC A 


269 


P 


VAL A 


199 




SITE 


1 


CAT 


1 TYR 13 6 




















5ITE 


z 


CAT 


1 LYS 


140 






















SITE 


3 


CAT 


1 


: 107 






















CRYSTl 


10^ 


.200 


10^. 


200 


75.990 


?0t 


PO 




.op 120 


.00 P 




2 1 


6 





10 



15 



20 



25 



30 



ORIGXI 



l.OQOOOO 0.000000 0.000000 



0.00000 



PRICTZ.. 



0.000000 1.000000 0.000000 



0,00000 



PRIgX3 



0,000000 0.00000 0 1.000000 



0.00000 



SCALEl 



O.QQ95?7 0.005541 0.000000 



0,00000 



35 



SCALE2 



0,000000 o.oiioei o.oooooo 



0-00000 



40 



o.oooooo 0.000000 0.013179 



0.00000 



HEXAZM 


1 


p 




1 


55, 


.§52 


-le.Boe 


22.535,,. 


1.00 


9, 


.73 


0 


HETATM 


2 


P 


HOH 


3 


58, 


r4?4 


-10, «9 


19.740 


1.00 


13. 


.17 


P 


HBXAOM 


3 


o 


HPH 


4 


58. 


,230 


-11.715 


27.770 


1.00 


1?. 


.07 


O 


HETATM 


^ 


p 


HOH 


5 


57. 


.252 


-3.759 


30.107 


1.00 


11. 


r21 


O 


HETATM 


5 


p 


HPH 




58. 


,2?9 


-10,011 


25.527 


1.00 


15, 


.74 


o 


HETATM 


6 


o 


HOH 


7 


49. 


.32X 


6.583 


38.815 


1.00 


1?1 


.33 


O 


miATM 


7 


p 


HPH 


8 


53. 


,795 


-4,2^2 


22,464 


1.00 


10i 


.94 


P 


HETATM 


? 


p 


HOH 


10 


74. 


.65? 


2,899 


9.141 


1.00 


17, 


.80 


O 


HETATM 




p 


HPH 


11 


4?, 


,m 


0,92§ 


32,9?§ 


1.00 


22. 


.02 


O 



45 
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10 


o 


HOH 


12 


55.530 


-11.162 


29.52fi. 


, 1.00 11,39 


P 




HETATM 


—11 




HPH 


13 


75.027 


7.034 


27.353 


1.00 16,30 


0 






12 




HOH 


14 




-2.314 


11.032 


1.00 21.33 


0 




HETATM 


13 


0 


HPH 


15 


61.323 


-8.959 


2?. 657 


1,00 22.94 


P 


5 


HBrMM 


—ii 


0 


HPH 


16 


61.02? 


-11.560 


29.131 


1.00 21.24 


0 




HETATM 


1? 


0 


HPH 


17 


50,684 


5.99L 


10.130 


I1PO 15.99 


0 




HETATM 


1$ 


0 


HPH 


19 


64.506 


-6.302 


32.989 


1.00 21.05 






HETATM 


17 




HOH 


19 


57.856 


-16.3?9 


25.085 


1.00 22.86 


0 




HETATM 


19 


p 


HPH 


29 


39.979 


?6.536_ 


19.070 


1.00 21.08 




10 


HETATM 


19 


Q 


HPH . 


21 


38.042 


33.487 


21.909 


.1,QP 19.01 


P 




HETATM 


2Q 


9 


HPH 


2\ 


• 38.172 


35.775 


20,927 


1.00 33.46 


0 




HETAin 


21 




HPH 


25 


70.916 


-11.126 . 


15.244 


1.00 31.:^7 


0 




HETATM 


22 


Q 


HOH 


26 


54.205 


I?r360 


29.396.. 


1.00 35.76 


0 




HETATM 


23 


Q 


HPH 


21 


50.436 


2.654 


16.783 


1.00 12.25 


0 


15 


HETATM 


2^ 


O 


HOH 


28 


6?.6,S2.. 


19.108 


38.979 


1.00 49.77 


0 




HETATM 


25 




HPH 


2? 


56.432 


. t8,977 


19.303 


1.00 22.52 


0 




HETATM 


2§ 


0 


HOH 


30 


60.832 


3.415 


42.349 


1.00 17.39 


0 




HETATM 


27 


Q 


HOH 


31 


53.889 


-12.7p§ 


?9,764 


1.00 22.40 


P 




HETATM 




-Q. . 


HOH 


32 


37.887 


26.373 


28.058 


1.00 18.09 


P 


20 


HETATM 


29 


0 


HOH 


33 


4?. 201 


11,173 


26.967 


1,00 33.95 


0 




HETAXW 




o 


HPH 


^1 


46.762 


-0.278 


31.394 


1.00 20.63 


0 




HETATM 


?1 


P 


HOH 


35 


41.731 


27.568 


43.302„ 


1.00 27.39 


P 




HETATM 


32 


p 


HOH 


3€i 


66.927 


11.202 


29.929 


1.00 13.23 


P 




HETATM 


5? 


P 


HOH 


37 


46.834 


14.396 


40.919 


1.00 46.02 


P 


25 


HETATM 


?4 


P 


HPH 


38 


61,342 


1.P64 


43.969 


1.00 26.68 


P 




HETATM 


35 


P 


HOH 


42 


70.597 


16.422 


37.837 


1.00 19.26 


0 




HETATM 


35 


o 


HOH 


44 


72.275 


-2.093 


33.407 


l.OP 22.11 


0 




HETATM 


37 


p 


HPH 


-15 


42.685 


34.461 


33.955 


1.00 17.32 


0 




HETATM 


39 


0 


HPH . 


46 


53.480 


13 r 394 


39,364 


1,00 20.19 


0 


30 


HETATM. 


3? 


p 


HPP 


47 


5^.085 


21,757 


44.744 


1.00 33.50 


0 




HETATM 


^0 


P 


HPH 


48 


35.741 


32.691 


23.517 


1,00 19,49 


0 




HETATM 


^1 


P 


HOH 


-9? 


40.458 


36.700 


34.312 


.1.00 34,53 


0 




HETAIM 


,,. n 


_P 


HOH 


50 


75.440 


7.267 


29.948 


1.00 18.07 


Q 




HETATM 


^3 




HOH 


51 


47.476 


19,347 


20,951 


IrOO 34.16 


P 


35 


HETATM 




P 


HPH 


53 


52.937 


-16,344.. 


19.587 


1.00 25.92 


0 




HETATM 




P 


HOH 


55 


46.415 


9.073 


20.109 


1.00 31.91 


0 




Mivl/vlrl 


46 


P 


HPH 


57 


45,?12 


35.170 


36.133 


1.00 35.55 


0 




HETATM 


^7 


p 


HOH 


58 


60.247 


-2.880 


41.919 


1,00 16,95 


0 




HETATM. 


4B 


P 


HPH 


69 


64.974 


6.086 


24.501 


1.00 32.16 


0 


40 


HETATM 


49 


P 


HOH 


61 


52.103 


4.683 


4.978 


1.00 35.72 


P 




HETATM 


50 


P 


HOH 


^2 


50i9e9 


40,154 


36.463 


IrPO 39.35 


0 




HETATM 


51 


o 


HOH 


63 


44.373 


31.233 


37.336 


1.00 20.07 


0 




HETATM 


52 


p 


HOH 


64 


57.280 


27.757 


42.451 


1-po 21.74 


0 




HEXATM 


53 


p 


HPH 


65 


58 .409 


23.769 


45.517 


1.00 58.42 


P 


45 


HETATM 


5^ 


p 


HOH 


66 


68.690 


-11.764 


35.335 


1.00 57.07 


P 
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HRTATM 5 5 O HOH 67 42.746 25.153 23.465 1 . 00 71 .OS Q 

HgTATM 5 6 O HOH fifi 53, "16.157 32.292 1-00 31.71 Q 

HRTATM 57 O HOH 69 33,390 41.716 31.408 I , gO 2?.?;^ 2 

HETATM 5B O HOH 70 57t769 17-897 42.434 1.00 25.75 Q 

5 mihm 59 0 HOH 12 75,6^7 9,1^1 llt76^ 1,99 35,13 Q 

HETATM 60 O HOH 72 ^Z^S^ 33,292 44.749 1.00 46.18 2 

HCTA-M 6 1 O HOH 73 47-310 U^JOZ 34.285 1.00 31.18 Q 

HETATM 62 O HOH 74 79.660 -3.947 15-913 1.00 34.63 Q 

KKTATM 6 3 O HOH 25 ^€■929 5,343 4,550 1.00 23.14 Q 

10 HETATM 64 O HOH 2^ 73,475 12.Ji22 ZS^^AiZ 1.00 27-26 Q 

HETATM 65 o HOH 12 46,297 -6.982 3Q.Q32 l.QQ 43,41 Q 

HETATM 66 Q HQH 2fi 69.526 -3.422 40.869 l.oo 38.47 O 

HETATM 6 7 O HOH 7.2 62-Q8Q -1.448 42.803 1.00 24.60 Q 

HETATM 6 8 o HOH ftfi 65.330 19tl5P 4Q.726 ItQO 41.QQ Q 

15 HETATM 69 O HQH gl 51.775 ^6.129 37.607 1.00 25.11 O 

HETATM 70 o HQH 54.266 29,692 43.313 1,Q0 27,61 

HETATM 71 O HQH £5 73.291 -15.479 20.603 1,00 37.54 2 

HETATM 72 o HOH 8^ 34t769 21,479 29.544 1.Q9 43,97 2 

^gTATM 73 B HOH a7 37,326 24,131 29,677 1.00 24.47 , 2 

20 HETATM 7 4 O HOH 88 65,168 20.148 6.735 1,00 26.10 2 

HETATM 75 O HOH 89 59.196 12.089 13.630 1,00 25.24 2 

HETATM 7 6 o HOH 91 66.576 "6.235 4P.279 l.PQ 43,11 Q 

HETATM 7 7 o HQH 23 37,339 29.394 25.515 1.00 27.56 2 

HETATM 78 O HOH 94 52.339 -17.014 42.271 1.00 48.96 2 

25 HETATM 79 o HOH 95 40,511 32,927 31.717 l.QQ 22.46 2 

ffETATM 80 o HOH ^6 79,590 13.121 34,139 IrOO 27r?9 2 

HETATM 81 O HQH 97 65,090 15,704 34,976 1,00 18.96 2 

HETATM 8 2 Q HOH 99 94,562 2.951 27,191 1-00 35.92 2 

HETATM 83 O HOH 100 50,396 9,761 9,6^6 l-OQ 23,18 2 

30 HETATM 84 O HOH 101 67.649 -0.851 39.764 1.00 24.99 O 

HETATM 85 O HOH 102 44.001 4.293 34.315 1,00 31,13 2 

HETATM 86 O HOH 103 59.386 -5.071 Zl^Jill 1.00 29,10 2 

HETATM 87 O HOH 104 77,364 4,745 41,506 1,00 35,32 2 

HETATM 88 O HOH 105 59.034 21.201 32,414 1.00 23,43 2 

35 HETATM 89 O HOH 106 42.463 34.698 14.327 1,00 38,96 2 

HETATM 90 o HOH 107 JSLJJJ. 11^222 20,864 1.00 42.39 2 

HETATM 91 O HOH 108 76.999 8.130 25.862 1.00 32.93 2 

HETATM 92 O HOH 109 49.766 29.937 22,17? 1,Q0 42.5^ Q 

HETATM 93 O HOH 110 72.473 13.536 39.923 1,00 33, 3e 2 

40 HEIMtl 91 Q H2H IXX 64,329 -12r094 39,608 l.OQ 37,99 2 

HETATM 95 O HOH 112 60.161 16 .3B2 42.682 l.OO 35.68 O 

HETATM 9 6 O HQH 113 47,6 02 13,639 27.016 3,. 00 26,0?- 2 

HETATM 97 Q HOH 115 64.606 11 .644 40,107 1.00 30.33 2 

HETATM 98 o HQH ll£ 61.231 -15.137 27,255 — ItOO 39,76 Q 

45 HETATM 99 O HQH 117 65.324 -11.223 35.098 1,00 30.45 2 
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aETAiM aoQ n — asm — 112 56,602 17.219 44.q:^ 2 1.00 36.53 q 

HETATM 101 Q HfiH Ufl 37.564 19. 860 23.135 1.00 31.27 Q 

HECMM 1122 Q H9H 121 64.845 5.057 21.132 1.00 4S.57 Q 

HBTAIM 1P3 Q asm 122 63,391 16.801 26.B 98 1.00 3B.46 Q 

5 HETA™ 10^ Q HQH 124 12.S^7 6.134 32.6 35 1.00 31.56 Q 

HgTATM lfl5 Q USm 125 72.485 13.236 35.059 1.00 29.61 Q 

flfilflOM IM O HOH 126 65.229 3.650 44.032 1.00 36. B6 O 

HBTAIM 1Q7 Q HQH 127 37.089 7. 148 31.083 1.00 39.58 Q 

HZTAIM IPg 0 H2H 12fi 73.327 10.546 12.12 3 1 .00 34.97 Q 

10 HBTA™ 1Q9. Q HQH 125 74.450 10 .299 26.598 1-OQ 3n.fln q 

HEiaiM 110 A05* WPP A 1 67. 524 13.055 26.692 1.00 36.42 Q 

HSIAIM 111 ACS* NDP A 1 68.089 12.297 25.614 1 .00 9.30 Q 

HETAIM 112 AC^* NOP A 1 69.601 12 .124 25.858 1.00 27.73 £ 

HETATM 113 AQ1* WPP A 1 70.193 11.258 24.848 1.00 22.97 Q 

15 HETAIM 11^ AQ?* WPP A 1 70.464 13 .390 25.873 l.QQ 17.83 £ 

HETATO 115 AP3* WPP A 1 71-192 13.436 27. 066 1.00 16.11 q 

HBTATM 116 AC2* WPP A l 71,373 13.220 24.626 1.00 11.46 Q 

HgTAlM 117 A02* NDP A 1 72.623 13.886 24,655 1.00 31.96 Q 

HfilAIM 118 ACl* MDP A 1 71.510 11.702 24.656 1.00 19.02 C 

20 ' HBlAIll 112 02 WPP A 1 65.336 13,590 26.129 1.00 20.59 o 

HETAm 12P WQ5* WPP A 1 63.536 11,943 26.448 l.QQ 29.99 Q 

BBTAIH 121 Wg?* WPP A 1 6^.329 10,843 25.957 1.00 24.89 Q 

HETAin 122 NC4* NDP A 1 63,467 9.646 25.686 1.00 31.79 Q 

HETAIM 123 WP^* WPP A 1 62.837 9.337 26.908 1.00 28,82 Q 

25 iIEXA3H 124 NC3* NDP A 1 62.340 9.837 24.665 1 .00 11.50 c 

HETAIM 125 WP3* NDP A 1 ^2,8?! 9.402 23.461 l.OO 28.60 s 

HETA3M 126 WCg* WPP A 1 61.152 9 .996 25.138 1.00 28.11 C 

HEIAIM 127 N02* NPP A 1 60.881 7.662 24.715 1.00 24,30 o 

HEIAiri 129 WCl* WPP A 1 61.547 8.875 26.580 1.00 3.S.3S Q 

30 HSTATM 1?? AP2* WPP A 1 73.104 15 .069 23.823 1.00 32.96 E 

HETAin 130 AOPl NDP A 1 74.500 15.308 24.308 1.00 37.84 Q 

HETAIM 131 A9P2 WPP A 1 72.797 14.925 22.348 1.00 36.66 Q 

HETAIM 132 AOP3 NDP A 1 72.163 16,217 23.958 1.00 31.97 q 

HETAIM 133 AP WPP A 1 66.660 14.257 26,393 1.00 26.17 

35 HEIAIM 123 API WPP A 1 66.886 14.795 25.047 1.00 15.31 ^ 

HETAIM 135 AP2 WPP A 1 66,439 15.207 27,521 l.QQ 34.39 ^ 

HETATM 136 AW? WPP A 1 71,820 11 .224 23.353 l.QQ 13.63 

HETAIM 137 ACQ WPP A 2 71.10^ 11.316 22.200 i.oo 12. 41 xx 

HETATM 13P AN7 NPP A 1 71.759 IP, 935 21.161 i.oo 15.71 ^ 

"^0 HETATM 139 AC5 WPP A 1 72.933 19,313 21.710 1.00 16.17 ^ 

HETATM AC$ WPP A 1 74.053 9.657 21.140 1.00 31.35 ^ 

HEIAIM — 1^1 AW9 WPP A 1 74.165 9.464 19.819 1.00 12.59 ^ 

HETATM 112 AWl WPP A 1 75.078 9.280 21.942 1.00 17.56 ^ 

HETAIM 1^3 AC2 WPP A 1 74.971 9 .578 23.251 1.00 1.S.44 ^ 

45 HETAIM — Hi AN3 WPP A 1 74.027 10.302 23.889 i.on ?4.r? ^ 



wo 99/64618 



PCT/US99/n576 



85 







.145 


AC1 


NDP A 


1 


73.036 


10 


.653 


23.017 


1.00 17,19 






HETATM 




Iff , 


OTP A.. 


1 


61.1&3 


.13 


.106 


27,1?1 


1.00 25.47 


N 




HRTATM 


117 




NDP A 




63,112 


..11 


.16? 


27.253„ 


1.00 29,6? 


N 




HETATM 


118 


wz 


NDP A 


1 


61.937 


12 


.613 


29,1?2 


1.00 24.32 


N 


5 


HETATM 


149 


mi 


NDP A 


1 ■ 


60.598 


? 


.775 


27.10? 


1.00 23,63 


N 




KSTATM. 


150 




NDP A 


1 


60.113 


10 


.905 


26.112J 


99.00 78.36 


N 




HETATM 


151 




NDP A 


1 


5?. 970 


11.648 


27,097- 


99.00100.00 


N 




HETATM 


152 


-JJC7, 


HDP A 


1 


59.1?7 


13 


.017 


26.529- 


99.00100.00 


N 






15? 


W7 


NDP A 


1 


5?. 359 


13 


.703 


25.972- 


99.00100.00 


N 


10 


KSTATM . 


151 


NN7 


NDP A 


1 


57.207 


13.400 


26.?12- 


99.00 84.38 


N 




HETATM 


155 


NC4 


-NDP A 


1 


59.112 


11 


. 146 


2B.137Z 


99.00100.00 


N 




HETATM 


15$ 


W5 


NDP A 


1 


59.??,2 


3 


■ ?63. 


29.751- 


99.00100.00 


N 




HETATM 


157 


NC6 


NDP A 


1 


5?.?51 




.266 


28.147- 


99.00100.00 


N 




ATOM 


156 


N 


A 


3 


76.227 


—-5. 


. 632 


44.315 


1.00 61.49 


H 


15 


ATOM 


15? 


£A 


LT3 A 


3 


76-152 


-1. 


u302 


43.684 


1.00 58.00 


c 




ATOM 


160 


C 


LYS A 




75.985 


,.t1. 


.121 


42.171 


1.00 52.79 


c 




ATOM 


1^1 


0 


A 


3 


76.S21 


-1 


.737 


11.11? 


1.00 44.76 


o 




ATOM 


162 






3 


77,35? 


-3 


.117 


11.039,.., 


1.00 59.74 


C 




ATC»4 


16? 




IiY? A 


3 


77,011 


-1, 


.?11 


11.311 


1.00 50.87 


c 


20 


ATC« 


151 


CP 


LYS A 


3 


79.208 


-1, 


rl61 


44.894 


1.00 61.21 


C 




ATC»4 


1$5 




LYS A 


? 


77.955 


-0, 


,377 


16.196 


1.00100.00 


C 




ATWl 


1^6 




LY5 A 


3 


79t957 


-0, 


rIOl 


47.343 


1.00 70.61 


n 




ATOM 


1^7 


N 


GLN A 


4 


74.746 


-1. 


r212 


11.717. 


1,00 15.15 


N 




ATOM 


159 


CA 


?ftN A 


1 


7itioe 


-1, 


r326 


10.317 


1.00 37. IB 


c 


25 


ATCM 


169 


Q 


GLN A 


4 


74.983 


-3. 


,166 


3?. 561 


1.00 34.93 


c 




ATOM 


170 


P 


GLN A 


1 


75.127 ,. 


-2. 


.05P 


40.087 


1.00 28.48 


o 




ATC»1 


171 


Cfi 


GLN A 


4 


72.915 


-1, 


-115 


10.221 


1.00 34.65 


C 




ATC»4 


172 


CP 


GJM A 


4 


72t156 


-5 , 


^51 


10.591 


1.00 31.82 


c 




ATOM 


173 


CD 


GLN A 


4 


72.570 


-6, 


,798 


39.405 


1.00 79.25 




30 


ATOM 


171 


OBI 


GLN A 


4 


72-165 


-6. 


152 


39,286 


1.00100.00 


C 




ATCM 


175 


NE2 


^J^ A 


1 


73.206 


-7, 


?25 


39.623 


1,00 80.24 


N 




ATOM 


176 


N 


ARG A 


5 


75.475 


-3 t 


1?5 


39.375 


1.00 27.16 


N 




AT9M 


J77 


CA 


AR<? A 


5 


76,116 


-2r 


516 


37.193 


1,00 39.16 


c 




ATOM 


176 


C 


AR<5 A 


5 


75, 19;^ 


-2. 


019 


36,133, .. 


1.00 38.22 


C 


35 


ATOM 


17? 


Q 


ARC A 


5 


71,?3e 


-2. 


698 


35.139 


1.00 32.11 


o 




ATCW 


leg 


CB 


ARG A 


5 


77.398 


-3.163 


36.826 


1.00 41.76 


C 




ATOM 


181 


CG 


ARC A 


5 


78,6?2 


"2, 


?51 


37.663 


1.00 37.34 


c 




ATCM 


192 


CI? 


ARC A 


5 


80.015 


-3. 


236 


36.976 . 


1.00 32.99 


c 




ATOM 


183 


HE 


ARC A 


5 


91.036 


-2. 


203 


37.125 . 


1,00 25.71 


N 


40 


ATWl 


,181 


ca 


ARG A 


5 


81.617 


-J. 


499 


36.169 


1.00 32.53 


c 




ATOM 


195 


NHl 


ARG A 


5 


81,2?3 


'I' 


704 


31r?01 


1.00 40.07 


N 




ATOM 


196 


NH2 


ARC A 


5 


92,516 _ 


-0, 


551 


36.474 


1.00100.00 


N 




ATO! 


197 


N 


VAL A 


6 


74.743 


-0. 


773 


36.65? 


1.00 32.08 


N 




ATWl 


le? 


CA 


YAL A 


6 


73.715 


-0. 


092 


35.881 


1.00 28.89 


c 


45 


ATOH 


19? 


C 


YAL A 


6 


71.161 _ 


1. 


021 


34.897 


1.00 29.37 


c 



wo 99/64618 



PCT/US99/11576 



86 

AXSai O VAL A 6 74.745 2. 041 35,274 1.00 22.50 O 

AlCM 12J — CB VM A g 72,577 0.378 36,813 i.OQ 7^.^7 £ 

ai£2M ISZ CGI VM A g 71.366 0.960 36. 006 1.00 20.29 £ 

Ig3 Cg2 VM A g 72.108 -0.852 37.644 1.00 18.45 £ 

5 ami fS&^ 2 0.749 33.615 1.00 22.92 M 

aiesti las gA phe a 2 74.267 1 .710 32.573 1.00 27.15 £ 

AISM EHE-A 2 72,?7$ 2.423 32.192 l.OO 20.24 £ 

AISl 1S2 Q EHB-^A 7 71.??^ 1,788 31.815 l.OO 20.71 o 

torn 12a £B PHE A 7 74.864 1.004 31,374 1.00 18.98 C 

10 Kim 199 eg FHE A 2 74,91^ 1.836 30.115 1.00 21-83 £ 

AlSfcl 20fi gPl PHB A 2 75.521 3,087 30.108 1.00 19.36 C 

AXCM 2Sa CD2 PHB A 7 74.483 1.284 28.886 1.00 23.50 £ 

him ZStZ CBl PHB A 2 75.614 3.828 28-902 1.00 27.52 £ 

him 2i23 gB2 PHE A 2 74.548 1 .996 27.685 1.00 19.33 £ 

15 AICM ZM £2 PHE A 2 llulZh 3.255 27. 673 1.00 18.59 £ 

ATCM 205 N IM..A 8 72.959 3 .727 32.454 1.00 18.75 H 

him ZM CA ILE A 8 71.644 4.588 32.112 1.00 T4.25 £ 

him 207 C U^Sl^ S 72.337 5.351 30.909 1-00 11.22 £ 

him 2ilS Q I1E_A B 73.259 6.165 30.9 98 1.00 17.76 O 

20 AlSai 2112 CB ILE A 8 71.507 5.605 33-212 l.OO 14.15 £ 

him 210 CGI ILE A 8 71.356 4.949 34.582 1.00 8.24 £ 

him 21i Cg2 A 8 70.183 6 .342 32.874 1.00 16.85 £ 

him 212 CPl ILE A 8 71,091 5.961 35-707 1.00 10.32 £ 

A153M 2X3 N hLl^ 2 71,89^ 4.906 29.752 1.00 16.42 M 

25 ATW 214 CA AIA A 9 72,256 5.559 28.513 1.00 18.74 £ 

him 215 C ALA K 2 71.530 6.913 28.511 1.00 28.45 £ 

him 21fi Q AIA A 9 70.411 7.032 29-045 1.00 22.39 £ 

him 212 Cfi AIA A 9 71.808 4.731 27.311 1.00 14.43 £ 

him 216 N GLY A 10 72.199 7.922 27.940 1.00 20.06 H 

30 ATCai 215 CA <?LY A ^0 71.706 9 .284 27.911 l.OO 18.62 C 

hSm ZZS. £ <?LY A 12 71.407 9.819 29.305 1.00 16.40 £ 

hSm 121 O PLY A 3.9 70.379 10 .448 29.481 1,00 17-36 O 

him 2Z2 II gl? A 11 72,295 9.581 30.2 72 1.00 10.32 U 

him Z21 ^ HIS A 11 72.068 9.966 31.6 88 1.00 13.90 £ 

35 him 224 C HIS A rX 72.008 11.504 31.91 6 1.00 21.52 £ 

him 22? Q HIS A 11 71.700 11.994 32.983 1 .00 13.22 Q 

AieSCa 225 CB HIS A li 73,153 9.350 32.581 l.OO 14.88 £ 

him 227 eg HIS A 11 7^.502 9.948 32.326 1.00 23.73 £ 

ATOM ZZSl NPl HIS A 11 75.239 9.648 31.197 1.00 24.90 K 

40 ATOM 222 CD2 HIS A ll 75.167 10.952 32.956 1.00 16.35 £ 

him 2iQ QS; HIS A 11 7^,317 10.407 31.170 l.OO 22.54 £ 

him 2iU ME2 HIS A 11 76.271 11.240 32.197 1.00 17.56 M 

him 232. rr Afig a u 72.310 12.288 30.908 1.00 22.31 ^ 

him CA Afig A 12 72.147 13.693 31,122 1.00 18.90 C 

45 AlOatl 214 C ARg A 12 70.651 14.244 30.495 1 . no 7fi-34 C 
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£ ASM A 40 77.798 6.649 20.308 1-00 24.53 Q 

aZSatl 4£i 2 ASM A IP 78.328 5.720 19. 688 1.00 19.96 Q 

BOSU 162 CB ASH A 19 79,13P 8.367 19,216 l.oo ifl.45 £ 

AICM l£a CG ASN A 40 78.054 8. 727 18.225 1-00 42.19 £ 

5 AlflM IM gPl ASM A 10 70.327 9,093 17-080 1.00 38.89 Q 

AIOl IfiS — HP2 ASM A 10 76,827 8.730 18.6 97 1.00 23.71 H 

AICM 16g N IiSy A U 76.513 6.622 20, 754 l.OO 21.0B li 

AlSaa ±£2 CA LEV A 11 75.^19 5.465 20. 650 i.oo t5.03 £ 

AlCM Ififi C IiBP A 13 75.225 5.068 19.2 13 1.00 18.22 £ 

10 AICM 4fi2 Q I.gP A 41 74.681 3 .971 18.980 1.00 15.72 Q 

AXCM aifl CB IiEV A 11 74.426 5.705 21. 532 1-00 IS.BS g 

AIQM 121 eg LBV A 41 74.822 6.029 22.974 1.00 21.90 Q 

Aiai 172 Cgl IiBV A 11 73,^01 6.413 23.749 1.00 20.59 £ 

Aiai 1L2 CD2 LEU A 41 75.481 4,796 23.609 1.00 17.97 Q 

15 AICM 421 H LEJLA 42 75.542 5.9 16 18.238 1.00 12.45 U 

AIS2b 425 ffl LEV A 42 75.256 5.607 16.831 1.00 15.99 C 

Aiai ,476 Q, LEJL^a 12 76.290 4.680 16. 280 1.00 26.18 Q 

AICM 422 Q LBJLA 42 76.066 4 .039 15.257 1.00 22.41 Q 

AICM 42fi CB LBV A 42 75.282 6.873 15.984 1.00 17.85 Q 

20 AICM 425 I^gg A 42 74.180 7.854 16.399 1.00 30.70 Q 

AICM mi CDl LEU A 42 74.318 9.184 15.704 1.00 24.31 £ 

AII3M 4fil CD2 LEV A 42 72.764 7 .241 16.208 1.00 31.13 £ 

AICM 182 M AgP A 1? 77.1$? 4.705 16. 911 1.00 26.87 U 

ATai 4£3 gA ASP A 12 78,579 3.875 16.486 1.00 19.29 Q 

23 AZfiM 4fi4 Q ASP A 43 78.583 2.519 17.163 1.00 13.33 £ 

AISM 4fi5 Q ASE_A 43 79.051 2.348 18.297 1.00 18.75 Q 

AISM 48£ gg AgP A 43 79,870 4. 580 16.776 1.00 31.06 £ 

Aiai 482 CG ASP A 43 81,083 3.758 16.380 1.00 30.68 £ 

AIS3M 488 PDl AgP A 13 80.971 2 .551 16.082 l.OQ 32.36 Q 

30 Aietl 415 0P2 ASP A 43 82.187 4 .308 16.499 1.00 37.83 O 

AlCM 45fi n 3ER A 11 78.13? 1.544 16. 377 l.OQ 16.89 U 

Aiai 4S1 CA 3SR A 44 77.978 0.173 16.789 1.00 17.67 £ 

AIS3M 422 C SSE^ 44 79.237 -0.463 17. 392 l.OQ 20.40 £ 

AlfiM 423 O SER A 44 79,206 -1.126 18.444 1.00 26.27 O 

35 Aicai 424 CB SER A 44 77.504 -0.617 is. 581 1.00 13.85 £ 

425 Q<? $ER A 14 76.800 -1 .740 16.063 l.OQ 43.83 Q 

A1<2M 496 N ARG A 45 80.335 -0.301 16.682 1.00 15.63 M 

AISM 1?? CA ARg A 15 81.616 -0.788 17.154 l.OQ 19.94 £ 

Aliai 428 C ARC..A 15 81.910 -0 .225 18.521 l.OQ 29.48 £ 

40 AIQfcJ 422 Q ARg A 45 82.244 -0 .937 19.457 l.OQ 27.65 Q 

AISM ififi CB ARg A 45 82.684 -0 .261 16.203 l.OQ 27.46 £ 

A12M 5ia eg ARG A 15 83.463 -1.338 15.495 l.OQ 92.03 £ 

AIQtl afi2 CD ARG A 15 84.854 -1.418 16.077 1.00100.00 £ 

AlSai 5113 NE ARg A 45 85.636 -2.533 15.527 1.00100.00 fc{ 

45 AlSaW 5il4 QZ ARg A 45 86.092 - 3.570 16.236 1. 00100.00 £ 
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AliSM C PHfi A 51 BO. 646 -4.603 26.61 2 1.00 34.01 £ 

aiOl 2 PHg A 51 80,550 -5.401 27,59 0 1.00 25,28 2 

ailSM aA2 £fi PHE A 51 78,389 -3.898 25,751 1.00 22.63 £ 

M21 ^ eg PHE A .51 77.156 -3,140 26.17 0 1.00 27,58 £ 

5 MCM ^ CDl PHE A 51 76.426 -3,525 27.280 1.00 21.78 Q 

AlflM 5i5 CP2 PHE A £1 76.663 -2.100 25. 380 1.00 19.55 £ 

AlflM 556 CEl PHS A ..51 75.267 -2.796 27.662 1.00 28,34 £ 

Ami 557 Cg2 PHE A 51 75.492 -1,403 25.734 1.00 14.47 £ 

AlSai S5fl EZ 3EHE-A 51 -1.744 26.878 l.OO 14.55 £ 

10 AlCM 553 N ALA A 52 81.576 -4.706 25.659 1.00 26,43 

AICM 5£fl CA ALA A 52 82.587 -5. 793 25.714 1,00 29.44 £ 

AICM 5£1 C AIA A 52 83,687 -5.560 26.768 1.00 43.76 £ 

AIQl 5fi2 Q ALA-A 5j2 84.502 -6 .446 27.022 1.00 40.33 Q 

AlSai 56? CB ALA A 52 83.228 -6. 049 24.344 1.00 24.25 £ 

15 A1S23EI 5fiil U SBR A 5? 83.702 -4.362 27.385 1.00 31.96 ^ 

AlSai 5^5 £A SSSUSk 53 84.705 -4.090 28.377 1,00 21.06 £ 

AlSai £6£ S fifiB..A 53 84.196 -3 .625 29.709 1.00 26.41 £ 

AISM 5^2 — Q SBR A ^? 81.9g5 -3.492 30.611 i.oo 36, 12 o 

AXSM 5£a CJB SEB^^a^ 53 85.709 -3.OB8 27.843 1.00 14,22 £ 

20 aiSai 512 OG SER A 53 85.140 -1.807 27.790 1.00 56,90 Q 

ATCM Slfl K QLV A 51 82.892 -3.431 29.874 1.00 22,38 n 

AISM 521 CA QLV A 54 82.380 -2.893 31.139 1.00 17.27 £ 

AlSai 522 C <?LV A 54 81.584 -3 .735 32.118 1.00 26.32 £ 

AICM P QLV A 51 81.229 -3, 281 33.191 1.00 37,43 Q 

25 AICM S2i Cfi — QLV A 54 81.677 -1.563 30.906 1.00 27,30 £ 

AICM 525 Cfi QLV A 54 82.573 -0 .543 30.262 1.00 44.77 £ 

AECM 52£ on fiLBLjL 5i 83.669 -0.142 31.194 1.00 86,31 £ 

aiCM 577 OEl GLU A 54 83.392 -0.232 32.428 1.00 50.11 O 

A12M 578 OE2 GLU A 54 84.785 0.198 30.692 1.00 50.99 O 

30 ACSM 522 — U AR& A 5.5 91.268 -4.971 31, 804 i.oo 29.63 n 

AISM 580 CA ARG A 55 80.636 -5.748 32.854 1.00 33,32 £ 

AISM 511 £ ARg A 55 79.347 -5 ,149 33.378 1.00 38.45 £ 

AICM 512 Q hRQ A 55 79,214 -4.897 34.576 1.00 40.18 o 

AISM 513 Cfi ARg A 55 81.621 -5.875 34.045 1.00 57.61 £ 

35 AISM eg ARg A 55 82.666 -7.028 33.960 I.QOIQO.OQ £ 

AICM 515 CD ARG A £5 82.805 - 7.805 35.305 I.QOIQO.OQ £ 

AICM 51i NE ARg A £5 82.838 -9.270 35.146 1. 00100.00 ^ 

AICM 512 ARg A 55 83.206 -10 .129 36.102 1.00100.00 £ 

ATCM 515 Wl ARQ A 55 83.583 - 9.681 37.301 1. 00100.00 K 

40 NH2 ARG A 55 83.20S -11.440 35.855 1.00100-00 H 

AICM £2fi n ILE A 5^ 78.367 -5.029 32.491 1.00 42.25 ^ 

AlSai 521 ILE A 5^ 77.064 -4.434 32.794 1.00 25.49 £ 

AICM 522 C ILE A 5^ 75.982 -5.474 33.244 1.00 20.18 £ 

AICM 523 Q ILE A ^ 75.897 -6.579 32.704 1.00 24.74 O 

45 AICM 525 £fi ILE_.a &6 76.672 -3.512 31.531 1.00 26.89 £ 
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CGI ILg A 56 77.643 -2.301 31.44? 1.00 18.30 Q 

aicM aa^ mz tlb a afi 75,214 -3.016 31-549 i.oo i^-84 ^ 

AIQM CDl ILE A 56 77.998 -1.936 30 .026 1.00 60.47 £ 

AIS3M 5jafl U ASP A 57 75.166 -5.133 34,2 37 1.00 16.84 

5 aiJ3M 5?? CA AgP A 57 74.040 -5.999 34.630 1 . 00 16.33 £ 

AXSaa fiJDlfl — C A5P A 57 72.676 -5.451 3 4.123 i.oo 7h.4q £ 

AISM 601 P ASP A 57 71.836 -6.198 33.657 1 .00 7^.^0 Q 

A12M 6Ii2 ASP A 57 74.009 -6.194 36.164 1.00 16.94 £ 

A1S2M §Sn CG ASP A 57 75.369 -6,720 36.703 1.00 34.27 Q 

10 aiai §sis — opi asp a 57 75.875 -7,729 36.i4i 1.00 3i.76 q 

AlSai fiiiS QD2 ASP A 57 76.040 - 6.007 37.499 1.00 28.36 Q 

aiCM 606 N gLK A 5g 72.443 -4,152 34.220 1.00 28.91 M 

AICM 6fi2 CA gLN A 59 71.193 -3,590 33.755 1.00 75.68 c 

AICM iilfl C gLN A &fi 71.425 -2 ,364 32-881 1.00 23.21 Q 

15 Ali3M Q gLK A 59 72.403 -1.620 33.0 67 1.00 18.16 Q 

AICM filfl CB gLW A 59 70.342 -3.151 34.9 46 1.00 33.14 £ 

AICM fill £S SUL^A Sfi 69.798 - 4.241 35.807 1.00 30-00 £ 

Ami 612 CD gLW A 59 69.226 -3 .712 37.105 1.00 27.18 £ 

ATPW 613 QBl gLM A 5.9 68.722 -2.601 37.161 1.00 31.20 Q 

20 614 NE2 GIJJ A 58 69.455 -4.436 38.186 1.00 16.89 

AI21 615 n VAIi A 5? 70.496 -2.138 31.9 61 1.00 18.35 JJ 

ATCM 616 CA VAL A 5? 70,562 -P,g99 31.045 1.00 15,59 £ 

AieSEi 612 £ VAIi A 5? 69.238 -0,240 31.0 39 1.00 26.28 £ 

AIS21 61S £2 iffiIi..A 55 68.178 -0 .820 30.762 1.00 19.51 Q 

25 AlflM 615 CB VAL A 5? 70.707 -1.456 29.601 1.00 15.32 £ 

Aiai §2^ Cgl YAIf A 5? 70.477 -0.274 28.649 1.00 11.93 £ 

AISM 621 <;;g2 YAL A 59 72,089 -2.111 29,364 1.00 15.83 C 

AISSM 622 U TTO A 60 69,306 1.064 31,293 1.00 21.71 jj 

AlCSi 623 CA TYR A fifi 68.113 1 .927 31.197 1.00 21 .40 £ 

30 Alffll 624 C TYR A 60 68,28? 2.756 29. 928 1.00 18.69 c 

Ami 625 Q TTO A 6fi 69.250 3.532 29.796 1-00 15.51 Q 

AlSaa 626 £B JXE^ 6fi 68.021 2 .817 32.413 1.00 17.24 £ 

Ami 627 CG TYR A 60 67t19? 2,131 33. 658 i.oo 19.71 £ 

Ami 629 Cgl TW A 60 69,3^5 1.583 34,586 i.oo 21. 14 £ 

35 Ami 629 CD2 TYR A 60 66.154 2.223 33,991 1.00 20.16 £ 

Ami 63Q CBl TYR A 60 67.835 1.080 35.794 1.00 19.11 £ 

Ami 611 CE2 TYR A 60 65.648 1 .698 35.163 l.QQ 10.77 £ 

AXai 6i2 C2 TYR A 6J2 66.476 1.094 36.054 1.00 20.07 £ 

Aiai 623 — CH TYR A — 60 65,921 0.565 37.248 i.oo ifi.04 q 

40 AlCSa 631 N LEV A 61 67.491 2. 452 28.916 1.00 17.46 U 

AliJM 635 £a I>EU A 61 67.685 3.053 27.585 1.00 20.17 £ 

AlflM 616 £ LEV A 61 67.003 4.412 27.409 1.00 23.36 £ 

Ami 6?7 9 LEV A 61 65,925 4.526 26.7 99 1.00 14.86 £ 

AISaM 625 Cfi LEy_.A 61 67.267 2.060 26.485 1.00 14.78 £ 

45 aiai 612 C55 LEV A 61 68.117 2.142 25.208 1.00 15.52 C 
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&I£3M fti5 Cg2 ILB h 91 67.B41 -5.316 21.84S 1-00 11-31 £ 

hJSXL fi.fii ILB A 91 66.320 -2.648 21-9 07 1-00 16.23 £ 

Aliai HI? A 92 71.446 -4.963 20.78 5 1.00 24.12 M 

AlfiSM ft£fl CA Hlg A 92 72.293 -6.015 20.2 43 1.00 26.71 £ 

5 AIQM 969 g HIS A 92 73,609 -6.251 21.071 1.00 29,30 Q 

Aiai aiiJ O HIS A 92 73.983 -7.366 21,443 1.00 18.58 Q 

AlflM aii CB HIS A 92 72.561 -5.682 18,7 75 1.00 22.23 Q 

AISSM ftZ2 eg HIS A 92 , 73.366 -6.720 18.077 1.00 26.32 Q 

AlCM 873 HPl Hlff A 92 72,798 -7.711 17,307 1.00 27.19 u 

10 AICM 871 CD2 HIS A 92 74.699 -6. 978 18,106 l.OQ 21.95 £ 

aiCM 975 CSl HIS A 92 73.755 -8.4B7 16,8 26 l.OQ ?3.fifi £ 

AI£3M aZfi WE2 HIS A 22 74.918 -8.062 17,296 i.on i7,:^fi y 

AICM 622 H ALA A 93 74.328 -5.1B7 21.3 33 1-00 15.66 H 

AlfiM B7B CA ALA A 93 7$.53p -?.3P1 22.110 l.oo n.flB q 

15 AlfitS 812 E ALA A 23 75.222 -5.900 23,512 1.00 28.78 £ 

Al£3tl BfiiJ £2 ALA-A 22 7i^_212 -6.790 24.037 1.00 25.23 O 

AIS2M ftfil CB ALA A 9? 76.139 -3.959 22.221 l.OQ 6.30 C 

^lOl 832 — M ALA A 9^ 24. -5, 24.113 l.oo 18,82 h 

Aien afij — ala a 24 73.777 -5.971 25. 399 1.00 15. 61 q 

20 AISM 814 C ALA A 94 73.593 -7 .503 25.301 1.00 28.39 Q 

Alflfcl 855 Q ALA A 24 74.133 -8.263 26.099 1.00 21.67 Q 

AlfiSM 8fijg CP ALA A 9^ 72.449 -5.279 25,911 1.00 18.46 Q 

AICM 887 H HIS A 95 72.814 -7.966 24,329 l.OQ 26.35 U 

AlfiM 888 OA HIS A 95 72.551 -9.396 24.271 1.00 24.89 £ 

25 AIS3M 889 C HIS A 95 73,845 -10.176 24,140 1.00 22.81 Q 

AIQM 890 O HIS A 95 74,077 -11.136 24.865 1.00 21.44 Q 

aXai B91 CB HIS A 95 71,571 -9,778 23.12 9 1,00 22.39 Q 

Ali2M 892 eg HIS A 95 71.554 -11.250 22.8 31 l.OQ 28.73 C 

Aiai 8.23 ml HIP A 95 70.97? -12,182 23.682 1.00 22.83 N 

50 iJlSm 824 HIS A 95 72.159 -11.964 21.8 45 1.00 25.22 C 

8^5 CSl HIS A 95 71.171 -13 .397 23.196 l.OQ 22.72 £ 

896 ^2 HIS A 95 71,911 -13.296 22.1 01 1.00 24.80 U 

822 M_ gMf A 96 7^.70? -9,658 23.281 1.00 19,97 H 

AlSai 128 CA gLN A 9^ 75.960 -10.299 22.917 1.00 22.27 C 

55 AXS3M 899 C gLH A .9..^ 76,677 -10.353 24.086 1.00 26.58 £ 

Aiai 9M Q SLfcLA 26 77,836 -11.093 24.088 1.00 24.17 Q 

AlSai 201 eg gLK A 9e 76,612 -9,492 21.818 1.00 23.38 Q 

AlSai 2112 Cfi GTM A 96 77.043 -10.299 20.596 l.OQ 61.06 C 

AIS21 2a3 on fildtLA 26 78.033 -9.557 19.675 1.00 75.83 Q 

^0 AI£2fcI 2114 OSl GLN A 26 78.999 -8.941 20.131 1.00 56.89 Q 

Aiai 225 WE2 gLN A 26 77.815 - 9.668 18.366 1.00100.00 ^ 

AISM 906 U ASN A 97 76.652 -9.500 25.060 1.00 22.15 ^ 

AliSM 2il2 gA ASN A 22 77,537 -9 .536 26.208 1.00 14.74 Q 

AlSfcl 208 C ASN A 97 76.732 -10.022 27.387 1-OQ 29.78 C 

^5 AIS21 2fi2 <2 ASN A 22 77.049 -9-762 28.564 l.OQ 27.09 O 
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dXm aifl CB MM A 27 78.241 -8.201 26. 462 1.00 12.93 £ 

AlCM 911 CG ASM A 97 79.260 -7.897 25.407 1.00 24.91 C 

AICM 212 WX A3N A ?7 80.331 -8 ,518 25.375 1.00 57.17 Q 

aiJJM 213 mZ ASM A 22 78.839 -7.135 24.3 92 1.00 34.88 M 

5 AlfiM 214 M AgP A 9B 75.666 -10.732 27.055 1.00 27.98 M 

AlSai 215 CA ASP A 2fi 74.907 -11.361 28. 089 1.00 29.25 C 

Alfib 916 C AfiJE^a 2fi 74.400 -10.379 29.1 64 1.00 37.53 Q 

A1S21 217 Q AgP A 99 7i,$Q$ -10,€?^ 30.367 l.OO 36.42 Q 

him 21B CB A3P A 23 75.791 -12.450 28.700 1.00 36,37 £ 

10 AIS2M 212 eg ASP A 99 75.016 -13.712 29.053 l.QQ fiB.62 Q 

AlfiM 22^2 QVl ASP A 99 73.775 -1 3.749 28.877 1.00 82.53 O 

AlCM 221 PP2 A3E. A 2fi 75.656 -14 .670 29.542 1.00100. 00 Q 

AlfiM 922 W VAT. A 99 73.879 -9-235 28.730 1.00 27.13 H 

AIi3M 223 CA VAIi A 22 73.157 -8 .351 29.635 1.00 21.57 £ 

15 ATOM 22i C VAL A 22 71.706 -8.668 29.530 1.00 16.15 C 

AI2M 225 Q VAL a 99 71.159 - 9.088 28.422 1.00 19.47 O 

him 2Z§ gP VAL A 22 73.264 -6.900 29.206 1.00 24.18 £ 

AICM 222 CGI VAL A 22 72.517 -6.015 30.198 1.00 14.58 £ 

AISM 22J W2 VAL. .A 22 74.720 -6,515 29. 225 1.00 30.10 £ 

20 225 U ASN A 19Q 71.149 -9.262 30.662 1.00 17.39 U 

AlSai- 22fi OA AgN A IPQ 69.852 - 9.925 30.613 1.00 25.77 £ 

AISM 211 £ ASH A IQQ 68.648 -9 .034 30.910 1.00 24.95 £ 

A12M 232 £ asm a lOO 67.498 -9.377 30.582 1.00 20.88 Q 

AIQM 223 C^ ASN A 100 69.846 -11.157 31.527 1.00 14.98 £ 

25 AISM 224 eg ASN A 109 68.724 -12.112 31.180 1.00 20.38 £ 

AIQM 935 P Dl A3H A IQQ 68.737 -12.709 30.100 1.00 29.59 O 

A12M 236 WI)2 ASH A IQO 67.716 -12.240 32.076 1.00 16.35 U 

AIQU 937 N LTS A IQl 68.941 -7.923 31.584 1.00 17.91 M 

him 238 OA LYP A IQl 67,970 -6.916 31.994 1.00 25.43 C 

30 iOm 939 C LYS A 101 68.107 -5.510 31.323 1.00 25.29 £ 

him 21fi £ LYS A IQl 69.151 -4 . 850 31.377 1.00 19.88 Q 

him 211 £fi LYS A 101 67.996 -6.807 33.521 1.00 29.28 £ 

him 212 gg LYS A IQl 67.464 -8 .054 34.205 1.00 9.31 £ 

him 243 CP LYg A IQl ^7,218 -7.719 35.668 1.00 38.93 £ 

35 him 214 CE LYg A IQl 66.206 -6.569 35.885 1.00 13.38 £ 

him 215 HZ LYS A 101 64.750 -7.006 35.825 1.00 15.26 fi 

him 21£ U LEU A 1Q2 67.013 -5 .043 30.732 1.00 22.22 U 

Ami 212 CA LEU A 102 67.003 -3.744 30.092 1.00 15.40 £ 

him 9^9 C LEV A 1Q3 65.612 -3.115 30.156 1 .00 18.55 £ 

40 ATPM 212 Q LEV A 10:^ 64.590 -3.811 30.102 1 .00 18.92 Q 

him 2S0 Cfi LEV A 1Q2 67.465 -3 .898 28.636 1.00 11.23 £ 

him 251 gg LEV A IQ? 67.553 -2 .711 27.651 1.00 15.51 C 

him 952 CDl LEU A 102 68.628 -2.985 26.559 1.00 9.65 £ 

ATOM 9$? CP2 LEV A 1Q2 66.162 -2.407 26.995 1.00 13.10 £ 

45 him 954 N LEU A 103 65.595 -1.798 30.318 1.00 17.05 M 
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AlflM CPl LEV A 111 40.991 8.797 T^.^Tn^ l.OO 49.^9 c 

AlflM CP2 LEU A 111 41.139 10.512 15i.^Q Q 1.00 26.85 Q 

AlSaa lfll2 LI AIA A 11? 43.103 6.473 lfl.5 27 1.00 29.00 U 

AliJM Ifllfl CA AIA A 115 42.920 5.446 19.5 28 1.00 25.66 £ 

5 AlflM 104? C ALA A 115 41.722 5.727 20.454 1.00 2a.76 £ 

A12M 1P5Q Q ALA A 11? 41.364 6.855 20.682 l.QQ 24.12 O 

A12M lim CB AM A 11? 11.177 5.272 20. 326 1.00 16.96 c 

AIS2M lfll2 M LXS A 11$ 41.137 4 ,675 20.998 1.00 30.21 ^ 

AISM IflSii CA LYS A 11$ 1g,03$ 4.792 21. 928 1.00 25.85 C 

10 AIQM 1254 C LYS A 116 40.66 8 5.248 23,195 1,00 14.1fl Q 

AlSaa 12^5 Q LYS A 116 41. 750 4.781 23.535 1.00 23.51 Q 

AISM lg?e LYS A 11$ 3?.369 3.415 22.116 i.oo 22.05 £ 

AIQM IIii2 eg LYS A 116 39.053 3 .032 23.524 1.00 55.38 £ 

AIOM 11155 on LYS A 116 37.963 1.955 23.549 1.00100.00 £ 

15 MOM QE LYS A 116 37.120 1.953 24.835 1.00100.00 £ 

AI2M 1960 Ii2 LYS A 116 35.767 1.310 24.630 1.00100. 00 u 

Axsai — ia£i — u gL^y a ii7 40.021 6.208 23.856 i.oo ia.23 a 

hJXSi CA giH A 117 40.456 6.757 25.180 1.00 21.01 £ 

hJm 1063 Q <?LN A 117 39.695 6.178 26.383 1.00 30.96 £ 

20 1061 Q gLN A 117 38.483 6.009 26.345 1.00 27-66 O 

hJSSl 1065 £fi gLN A 117 40.215 8.263 25.179 1.00 11.32 £ 

hXSSl 1066 C2 gLN A 117 40.849 8.912 23.948 1.00 12.12 £ 

A12M 1032 CE gLH A 117 42.404 e.823 23.954 1.00 24.10 £ 

AISM 10£8 QBl gLN A 117 43.041 8. 628 22.896 1.00 47.88 Q 

25 AIS211 NB2 <?LW A 117 43.001 8 .953 25.131 1.00 14.24 U 

aisai — 1070 n pr9 a iib 40.374 5 .992 27.499 i.oo 30.02 k 

Aiai 1021 Ca PTO A 118 41.826 6 .194 27.655 1.00 26.44 £ 

aiSai 1022 Q PRO A 119 42.450 5 .050 26.899 1.00 24.37 £ 

ATOM 1022 Q PRQ A 119 41.792 4.027 26.726 l.QQ 25.34 o 

30 iiXm 1071 £B PRP A 119 42.055 5.994 29.167 l.QQ 23.89 £ 

AlCM 11125 £2 PRO A 119 40.847 5.240 29.654 l.QQ 23.20 £ 

aiSM 1076 PRO A 119 39.695 5.519 28.709 l.QQ 15.79 £ 

aiS2M 1072 U MET A 119 43.684 5.228 26.432 1.00 16.00 U 

AISM 1020 CA MBY A 119 44.372 4 .215 25.644 1.00 10.80 £ 

35 AIQM 1022 C MET A 11? 45.062 3 .083 26.444 1.00 23.61 £ 

AlCd 1090 Q MET A 119 16.013 3.261 27.209 1.00 18.02 Q 

AIGM IfllO Cfi MET A 119 45.384 4.894 24.791 1.00 13.52 £ 

1012 eg MET A 119 44.801 6.014 23.989 1 .00 18.52 £ 

AXCM 1002 SP MET A 119 46.157 7.054 23.271 l.QQ 26.77 j 

^0 AISM 108i CB MET A. 112 46.264 6 .524 21.845 1.00 33.79 c 

AIQM 1M5 H ALA A 120 44.559 1 .875 26.271 1.00 26.64 u 

iiliaCl lOM Ca ftLA A 1?0 45.177 0.712 26.8 64 1.00 29.17 c 

aiQM 1002 £ ALA A 120 46.356 0.308 25.984 1.00 23.21 £ 

aiSd mS O ALA A 120 46.439 0 .759 24.833 1.00 20.19 £ 

^5 1099 Cfi ALA A 120 44.169 -0.419 26.944 l.QQ 26.02 £ 
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ATOM IPgQ H SLV A 121 ^7,238 26,507 1,PP 12t30 M 

ATOM mj CA gLV A 121 48.427 -1.009 25.788 1.00 9.45 £ 

Aiai ism £ gLV A 121 48.070 -1.697 24.450 1.00 11.68 £ 

AIS2M m3 Q gLV A 121 49,928 -1.670 23.450 1.00 14.84 Q 

5 1094 CB GLU A 121 49.321 - 1.883 26.715 1.00 16.74 £ 

AlSaa IQg? TO gLV A 121 50.132 -1.122 27. 763 1.00 18.14 Q 

AlfiM mfi Cfi GLU A 121 49.458 -1 ,000 29.137 1.00 13.00 £ 

AlflM 1997 ggl gLV A 121 48.252 -1.294 29,276 1.00 20.79 O 

AlSatl 1999 QE2 gLV A 121 50.123 -0.521 30,080 l.OO 17.86 Q 

10 AlfiM 1999 N gER A 122 46.887 -2.273 24,409 1.00 11.79 N 

AICM 1100 CA 3ER A 122 46.427 -2.977 23.218 1.00 12.16 Q 

AISM mi C 3ER A 122 46.030 -2.058 22.100 1.00 11.70 C 

AIGM USiZ Q SER A 122 45.717 -2.529 21.010 1.00 13.91 q 

AICM 11Q3 CB SER A 122 45.186 -3.781 23.568 l.QQ 21.50 £ 

15 AlflM llfii 02 5ER A 122 44.143 -2.908 23.976 1.00 28.52 Q 

AXCM HQS M gLV A 12? 46.041 -0.754 22.3 41 1.00 14.65 U 

Ali2M IXM CA gLV A 123 45.783 0.202 21.243 1.00 17.15 C 

A1S21 11P7 g gLV A 123 46.959 0.313 20.240 1.00 11.48 Q 

ATOM Q gLV A 123 46.821 0 .844 19.141 1.00 11.19 Q 

20 iMai lias CB GhV A 123 45.481 1.600 21.805 1.00 21.66 £ 

Aiai nifi eg gLV A 123 44-127 1.694 22.523 1.00 24.68 C 

ATOM 1111 CD GLU A 123 42.984 1.374 21.585 1.00 35.56 Q 

AIS3M 1112 PEl gLV A 123 43,019 1.865 20.426 l.QQ 41.73 Q 

AI2M 1113 gE2 gLV A 123 42.158 0.497 21.940 1.00100,00 Q 

25 hJS^ 1111 N LEV A 121 48.134 -0.185 20,618 1.00 14.02 U 

him 1115 CA LEU A 124 49.296 -0.082 19.740 1.00 15.32 £ 

ATOM 11 16 q LEV A 124 49.062 -0.754 18.458 1.00 17,76 C 

AlCM 1112 Q LEV A 121 48.752 -1.917 18.445 1.00 18.91 O 

him nifi CB LEV A 121 50.564 -0.680 20.362 1.00 18.07 £ 

30 jmai 1112 Cfi LEV A 121 51.922 -0.222 19.803 1.00 21.52 C 

AIQM 122S1 CPl LEV A 121 52.080 1.258 20.117 1.00 20.35 £ 

ATOM 1121 CD2 LEU A 124 53.042 -0.919 20.550 l.QQ 14.07 C 

AX2M 1122 n LEV A 12$ 49.514 -0.071 17.409 1.00 18.44 U 

ATOM 1123 CA MV A 125 49,445 -0.564 16.052 1.00 19.92 C 

35 1124 £ LEV A 12$ 48.034 -0.754 15.509 1.00 25.56 £ 

AlCM 1125 Q LEV A 12$ 47.854 -1.188 14,364 1.00 18.26 O 

AIGtl 1125 CB LEV A 12$ 50.355 - 1.800 15.840 l.QQ 20.79 £ 

ATC^ 1122 eg LEV A 12$ $1.999 -1.511 15.778 1.00 17.21 £ 

hXm 112fi CPl LEV A 12$ 52.744 -2 .649 16.316 1.00 19.95 £ 

40 AJW U29 CPS LEV A 125 $2,3?^ -1.219 14.338 1.00 5.81 £ 

ATGK 1130 M GLN A 12 6 ^7.027 -0.327 16.276 1.00 21.97 M 

AIC3^f 1131 CA gLN A 126 45.652 -0.504 15.790 1.00 19.97 £ 

ATCM 1132 C gLW A 12$ 45.213 0.447 14.724 1.00 28.31 £ 

hlW 1133 Q gLW A 12e 44.076 0.391 14.293 1.00 47.49 Q 

45 ATW 113 4 CB G LN A 12 6 44,652 -0.404 16.911 1.00 19.87 C 
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C 
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1 , 
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N 


25 
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,$?1 


11 


09 


13.22 


C 
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11. 
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1. 


09 


23.44 


C 
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11. 


1$2 


13, 


9?0 
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00 


21.23 


O 
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130 


50.150 


11. 
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15. 


,935 


1. 


00 


18.18 


c 
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11« 


TO 
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130 


59.751 


13. 


37$ 
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,99$ 


1. 


00 


77.44 


c 


30 
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130 
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11, 


,121 


1. 


00100.00 


c 
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1155.. 
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48 .588 


11. 
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00 
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o 
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15. 


1$1 
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,2?5 


Ir 


OP 


21.03 


O 
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1157 . 


N 
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5P,?29 


12. 


21? 


11. 
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1. 


00 


21.35 


N 




ATCM 
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«\ 


PRO A, 
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52,923 


12. 


19? 


19. 


731 


1. 


00 


11.79 


C 


35 
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0 
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131 ,,_ 
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13 . 
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11. 
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1. 


00 


11, ?9 


c 
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54.325 
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19. 
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1, 


00 


20.99 


0 
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CB 
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51,113 


13 t 
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?, 
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1, 


09 


11,7$ 


• c 
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1172 


CG 
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131 


59,971 


13, 


195 


?, 


?1? 


1. 


09 


20.99 


c 
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1173 
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131 


i?.eii 


12, 


62$ 


11. 


017 


1. 


00 


17.25 


c 


40 
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52.?9$ 


11 r 
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12. 
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1 . 
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18.77 
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$9? 
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c 




ATCM 


117$ 


C 
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109 


1. 


00 


21.91 


c 
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1177 
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132 


56.317 


;i. 
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13. 


231 


1, 


09 


24.17 


O 




ATCM 


1178 


CB 


THR A 
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53.71$ 


15. 


?97 


13. 


606 


1. 


00 


23.45 


c 


45 


ATW 


1179 


OGl 


THR A 


132 


52.976 


1$. 


883 


12- 


950 


1. 


00 


31.15 


o 
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ATCM 


U60 


Cg2 


THR A 


132 


54.969 


16 


.519 


14 


.341 


1 


.00 


9. 28 


C 
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29.59 


N 




ATCW 


1192 


CA. 
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26.38 
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C> 
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.994 
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.755 
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.00 


19.67 
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.169 
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12 
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16 


.969 
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.00 


50.55 
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53.354 


13 


.272 


17 
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40.07 
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1. 
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9.98 
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1. 


.00 


9.66 
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12. 


,274 


1, 


rOO 


13.86 
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o 
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AICM 122? ■ Q TLB A 138 5gt3Q7 2t377 14.970 l.QO 13.79 Q 

ATOM 122 6 CB ILE A 138 58.064 4.172 12.316 1.00 17. B5 Q 

Aietl IZ22 QQl ILB A 138 56,680 ±^A22 11-757 i.oo ?fl.?i q 

CS2 TT.K A 138 58.674 2.986 11.602 1.00 £ 

5 ATCM 122? CPl ILE A 139 $$.69$ 3.37$ 11.970 i.oo la.i? q 

AlflM 123Q N ftIA A 139 6Qt213 1.361 14.625 i.oo n.54 m 

Aisai 1221 CA AIA A 139 61.191 3t937 15.288 i.oo 13.22 c 

AlSai 1232 C AIA A 139 61.256 3.364 16.675 1.00 lfl.73 £ 

AieSSl 123? g ALA A 139 61,791 2r319 17.031 i.oo 20.44 q 

10 AISM 1214 « ALA A 139 62,134 $,P73 15.390 i.oo 13.62 £ 

MOM 1225 M Lyg A IIP • 6P.397 1,P33 17.448 i. oo 16.36 M 

AISM IZM CA LYS A 140 60.083 3.600 18 , B15 1.00 15.14 Q 

ATOM 1237 £ LYS A 110 $9.392 2t262 1B.B24 i.oo 15.18 c 

ATOM 122fi Q LY? A 110 $9.921 1,316 19.475 1.00 21.42 q 

15 AlQH 1239 CB LYS A 140 59.19 3 4.606 19.525 1.00 17.86 Q 

AIS3St3 1210 QQ LYS A 119 59.925 5.806 20.152 1.00 21.11 Q 

ATC^ 1241 CD LYS A 14 0 61.208 5.478 20. 958 l.QO 16.75 £ 

Aliai 1242 CE LYS A 110 fiJU.fifi4 6.735 21.B35 1.00 10.06 £ 

Ami 1213 H2 LYS A IIP 62-688 6.496 22.921 1.00 14,40 ^ 

20 AlCM 1211 N ILE A 111 58.356 2.116 1B. 027 1.00 11.49 U 

AlCai 121$ CA A 141 57.703 0.828 17.977 1.00 17.92 £ 

him 124j6 Q ILB A 111 58.729 -0.282 17.577 1.00 13.46 £ 

AISM 1247 O TLE A 141 58.730 -1.374 18.148 1.00 13.92 Q 

AXS3M 1245 CS ILS A 111 S6.197 0-925 17.019 i.oo 22.59 £ 

25 MSai 1219 Wl XLE A 111 55-466 1,906 17.557 1.00 17.61 £ 

ATOM 12$P TO2 ILg A 111 55.863 -0,411 16.700 l.QO 10.49 C 

ATSSM 1250 TOl ILE A 111 54,530 2.327 16.449 1.00 13.43 £ 

AlSai 1252 n ALA A 112 59,637 P,9g9 16.650 1.00 10.29 M 

ATOM 1253 CA ALA A 142 60.657 -0.931 16.228 1.00 7.15 £ 

30 AlfiSM 12$4 C ALA A 112 61,456 -1.301 17.456 1.00 16.58 £ 

ATt^ 12^5 Q ALA A 112 61.839 -2.454 17.621 1.00 13.04 o 

AXCtl 1256 CB ALA A 11? 61r6P1 -0.288 15.130 1.00 4.44 £ 

ATOM 1257 N GLY A 143 61.703 -0 .307 18.316 1.00 9.56 H 

AI2tl 1255 gLY A 113 62.448 -0.525 19.527 1.00 5.15 £ 

35 ATCyt 12$9 C gLX A t13 61.770 -rl.555 2 0.430 1.00 16.36 £ 

AlSai 12&e Q gLY A 113 62,392 -2.482 20. 967 1,00 14.11 Q 

AI2M 12^1 U ILE A 11^ 60.476 -1.418 20.564 1.00 20.33 u 

ATOM 12£2 CA ILE A 111 $9.72$ -2,314 21.407 1.00 15.35 £ 

AIQM 12£3 Q ILE A 111 $9,7Q6 -3.732 20.859 1.00 19.84 £ 

40 KTOK 12 64 O ILE A 144 59.836 -4.700 21.608 1.00 17.93 Q 

AIS2M 1 26^ C S ILE A 111 58.317 -1.819 21. 559 1.00 10.60 £ 

liim 1215 CGI ILE A 111 58.311 -0.610 22.516 l.QO 9.80 £ 

hXm 12 67 CG2 ILE A 111 $7, HQ -2,929 22.122 i.oo 9.60 £ 

AlSai 1268 CDl TLE A 144 57.022 0.076 22.517 1. 00 18.32 £ 

45 ATCM 12 69 N LYS A 145 59.520 -3.841 19,556 1.00 7.20 H 
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ATOM 12211 Cft Lyg h li^ 59.459 -5.139 18. 926 1 - 00 7,64 £ 

aiQM 1221 C LY3 A 1^5 60-840 -5.788 18 .931 1.00 15.32 Q 

ATOM 1222 Q IiYS A. 115 60.923 -6.989 18. 9B1 1.00 14.76 Q 

ATCM 1222^ CB LY5 A 115 58.991 -5.001 17 .516 1.00 11.25 £ 

5 ATOM 1224 CG LYS A 115 57.414 -4.5B1 17. 489 1.00 12.13 £ 

AISZM 1225 CjQ LYS A 115 56.642 -5 .434 18.495 1,00 25.23 £ 

AIS3M 122fi CB LXS A 115 55.189 -4.995 18.6 92 1.00 13.64 £ 

AlSai 1222 UZ IrXg A 115 54.441 -6.111 19 ,392 1.00 11.94 u 

AIQM 122fl n LBV A 11^ 61,934 -5.011 18 .986 1.00 26-9B 

10 aiQM 127? gA LBV A 116 63.261 -5,642 19.167 1.00 19.72 £ 

AieStt 12fiI2 e LEU A 146 63.262 -6.316 20.542 1 ,00 18.20 £ 

Ami 1211 Q LBV A 116 63.590 -7.511 20.703 1.00 19.Rfi q 

KJISSS 1212 CB LBV A llg 64 .398 -4.618 19. 150 1.00 13.5fi £ 

AXCEM 12£3 CG LEV A lie 64.895 -4.258 17. 759 1.00 21.B4 £ 

15 ATOM 1214 CDl LEU A 146 65.672 -2 .945 17.817 1.00 17.94 £ 

ATCM 1215 CD2 LEV A 11^ 65.745 -5.397 17. 102 1.00 16.10 £ 

Aiai U CYS A 147 62.931 -5.523 21.548 1.00 7.91 

AlSai 1212 CA CY? A 117 62 .875 -6.064 22.893 1.00 9.14 £ 

AIQM 1288 £ CYS A 117 62.072 -7.378 22.9 45 1.00 22.72 £ 

20 ATOM 1215 Q CYS A 117 62.568 -8.401 23.383 1-00 16-90 O 

Alffll eg CY? A 117 §2.^222 -5.058 23.809 1.00 12.63 £ 

Aiai 1221 SG CYS A 117 63.411 -3.823 24. 316 1.00 15.02 £ 

A3:ai 1222 U gLV A 119 ^0,923 -7.352 22.508 1,00 20.03 N 

AXCM 1223 CA gLV A 119 60. 016 -8.555 22,567 1.00 16.09 £ 

25 1225 £ gLV A 118 60.685 -9.715 21,802 1.00 22.61 £ 

AlSil 1225 2 gLV A 118 60.651 -10 ,888 22.226 1.00 12.05 Q 

AlfiH 1226 CP gLV A lie 58.597 -8.268 22.046 1.00 14.66 £ 

Aiai 1297 CG GLU A 148 57.864 -7.1B9 22.840 1.00 11.45 £ 

AlfiM 122s CP gLV A 119 56.471 -6 .821 22.277 1.00 11.75 £ 

30 ai£33a 1222 OSl GLV A 118 56.117 -7.055 21.080 1.00 11.65 o 

AlfiM 1110 QB2 gLV A 116 55.728 -6.231 23.081 1.00 22.56 O 

AISM 1301 U SKR A 149 61.368 -9.377 20.715 1.00 15.57 y 

AIQM 1112 CA SER A 11? 61.938 -10.428 19.997 1-00 10.21 £ 

AlSai 1111 £ SBR A 11? 63.040 -11.245 20.502 1.00 15.83 £ 

35 ATW 1304 O ?BR A 119 63.102 -12.458 20.291 1.00 12.72 Q 

Aliai 1305 CB SER A 149 62.270 -9.936 18.488 1.00 9.44 £ 

AIJ3M 13Q$ Qg SER A 119 61.053 -9.650 17.782 1.00 15.91 Q 

AIQM 1307 N TYR A 15Q 63.910 -10.546 21.224 1.00 18.44 u 

AIJ21 1 309 C A TYR A 15P 65.065 -11.100 21.94B 1 . 00 20.50 £ 

40 AICM 1112 C TYR A 150 64.514 -11.848 23.15B 1 . 00 21.87 £ 

Aiar lUQ Q. TYR A 150 64.939 -12.949 23.486 1.00 31.39 O 

AIQtl 1111 Cfi TYR A 1^9 66.005 -9.950 22.425 1.00 13.71 £ 

ATCM 1312 eg TYR A 159 66.994 -9.509 21.365 1.00 14.13 £ 

ATOM nil CPl TYR A 15Q 66.611 -8.673 20.317 1.00 14.64 £ 

45 ATOM 1314 CD2 TYR A 150 68.288 -10.000 21.360 1.00 18.32 £ 
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ATOM 131S CRl TYR A 150 €7,^87 "fi , 3^0 lg.278 1 , QO ll.gJ £ 

ATCT4 1316 CE2 TYR A ISO 69.198 -g.ggg 20,31? I . PP 11. IQ £ 

ATOK 1317 cz TYR A ISO €9,8P1 -6.9PP l?.32e 1 , PP 2P.95 C 

ATM 1318 m TYR A 150 69,73? -9.^85 18.333 — l.PO 27.73 Q 

5 ATOM 1319 N A3N A 151 63.536 -11.249 23.801 1 , PP 11,93 H 

ATCT4 1320 CA A5W A 151 62.gP3 -11.999 24.937 l.PP 23,62 Q 

ATOM 1321 C ASM A 151 62,417 -13.244 24, IIP 1,PP 28.53 £ 

ATCT4 1322 O A3N A 151 62.630 -14.248 25.072 1.00 25.89 O 

ATCM 1323 CB A3N A 151 61.^55 -11.113 25.439 1.00 20.95 £ 

10 ATOM 1324 CG A3N A 151 61.988 -9.867 26.284 ItQO 1?.P7 c 

ATOM 1325 ODi A3N A 151 61.126 -9.P2P 26.466 IrPP 26i72 2 

ATOM 1326 ND2 ASK A 151 63.231 -9 .709 26.700 1-00 6.31 N 

ATOM 1327 N ARG A 152 61.731 -13.249 23.259 1.00 19.91 H 

ATOM 132 8 CA ARG A 152 61.129 -14.465 22.687 l.PP 17.62 Q 

15 ATOK 132 9 c ARG A 152 62.P9P -15,523 22^1fl5 1,PP 21,34 £ 

ATOM 1330 O ARG A 152 61.959 -16.687 22.542 l.PP 15,44 Q 

ATOM 1331 CB ARG A 152 6P,086 -14.149 21.61P IrPP 1^,30 £ 

ATC^ 1332 CG ARG A 152 58.672 -13.754 22,157 l^PO 17.22 £ 

ATOM 1333 CD ARG A 152 57.652 -13.297 21.049 1,PP 9.13 £ 

20 ATOM 1334 NE ARG A 152 57.161 -14.419 2P.241 l.PP 21. P? U 

ATOM 133 5 C2 ARG A 152 57.159 -14,447 lfi.,.212 l.PP 28.61 £ 

ATCT4 133 6 NHl ARG A 152 57.$9P -13,387 1£..2Z1 IrPP 21t98 n 

ATOM 1337 NH2 ARG A 152 56.717 -15.528 l£^i£2 l.PP 26.11 K 

ATOM 133 8 W GLN A 153 63.P99 -15.1P4 21.434 1,0P 16.54 U 

25 ATOM 1339 CA GI^ A 153 64.044 -16,036 2P.942 l.PP SUii £ 

ATOM 1340 c GLN A 153 65tP92 "16,443 21.?p7 l.pp 16r7P £ 

ATOM 1341 O GLN A 153 65,529 -17, $^5 21,763 1,PP 24,35 S 

ATOM 1342 CB GI^ A 153 64.789 -15.372 19.714 l^OO 8,99 £ 

ATOM 134 3 CG GI.N A 153 65.935 -16.225 12^JJ^ 1,00 ±^ C 

30 ATOH 1344 CD GLN A 153 66.315 -15.637 17.762 l.PP 14.17 £ 

ArCM 134 5 DEI GLW A 153 65,61JL -14,763 17.254 l.PP 12.53 Q 

ATm 134 6 ME2 GLN A 153 67,466 -16.P24 17.229 l.PP 13,38 N 

ATOM 1347 H TYR A 154 65.566 -15,518 22 . 6P8 l.PP 14.35 H 

ATOM 1348 CA TYR A 154 66.677 -15.839 23,483 l.PP 12.16 £ 

35 ATOM 1349 c TYR A 154 66.323 -15.93P 24.954 l.PP 19tP6 £ 

ATCT^ 1350 o TYR A 154 67,185 -16.2P7 25.777 1,PP 25.59 2 

ATOM 135 1 CR TYR A 154 67.829 -14,816 23,326 1,PP 16.99 £ 

ATOM 135 2 CG TYR A 154 68.419 -14.733 2JU113 l.PP 17.53 £ 

ATOM 135 3 cm TYR A 154 69.259 -15.726 21.467 l.PP 18,91 £ 

40 ATOM 135 4 CD2 TYR A 154 68,P8P -13.712 21.P91 l.PP 13.97 £ 

ATOM 135 5 cEi TYR A 154 69.782 "15.696 ZiLOSfl l.PP 1Q.98 £ 

ATOK 1356 CE2 TYR A 154 68.621 -13.639 19.BQ6 l.QP 23.81 £ 

ATOM 1357 C2 TYR A 154 69.488 -14.634 19.38P 1,PP 23, P8 £ 

ATOM 1358 OH TYR A 154 70.002 -14.619 18.118 1.00 23.87 Q 

45 ATOti 1359 N GI.Y A 155 65.080 -15.686 25,313 1 ■ OP 12. P9 H 
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ATXM 1405 N SBR A 160 59.689 -2.661 28.859 1.00 24,44 M 

ATOl 140 6 CA 3BR A 160 59.312 -l,3g3 29,209 1 . 00 21t?? Q 

ATCT4 1407 C SBR A 160 58,242 -0.577 28.950 1.00 25.07 Q 

ATGti 1408 o SER A 160 57.2?? -1,127 2g,^S^ 1.00 3.7,02 Q 

5 ATC»1 1409 CB 3ER A 160 58.719 -1 .747 26.79 7 1.00 13,05 _£ 

MCM 1410 QG SBR A 160 59.782 -1.9?7 25,895 1,00 37, ?7 fi 

ATOM 1411 N VAL A 161 58,378 Q,742 28,927 1.00 21.01 

ATOM 1412 CA VAL A 161 57.36? 1>61^ 29,50? 1,00 axifi £ 

ATOM r VAT, A 161 57.068 2.747 28.504 1.00 16,77 £ 

10 ATOM 1414 O VAL A 161 57.955 3.149 27.729 1 . 00 16.33 ^ Q 

ATQ^ 1415 CB VAL A 161 57.806 i^^lfl 30,862 ItOO 17.91 _C 

ATCT4 141 6 CGI VAL A 161 57,873 1,185 31,384 l.QO 16.16 — C 

ATCCM 1417 CG2 VAL A 161 59.137 2.992 30,750 l.QO 21.10 C 

ATOM 1418 N MET A 162 55.794 3,147 28,44? IrQQ 2^,46 U 

15 ATOM 1419 CA MET A 162 55.296 4.185 27.513 1,00 19.23 C 

xrcM 1420 c MET A 162 54.88Q 5.312 28.397 1,00 25,19 £ 

ATQ^ 1421 O MET A 162 53.788 5.269 28,961 1.00 18.35 Q 

ATCM 1422 CB MET A 162 53.979 3.796 26.850 1.00 15.55 C 

ATC»4 1423 CG MET A 162 54,013 2.630 25.949 ItOO 37,79 Q 

20 ATm 1424 SD MET A 162 54.354 3 .100 24.235 1,00 52.07 £ 

ATCT^ 1425 CE MET A 162 56.193 3.134 24.410 1.00 36.30 £ 

ATOM 1426 K PRO A 163 55.730 6.313 2fi.,^21 1-00 18.43 U 

ATOM 1427 CA PRO A 163 55.390 7.472 29.337 1.00 17.76 £ 

AT<^ 1428 C PRO A 163 54.300 8.384 28.667 1,00 21.23 £ 

25 ATOM 1429 O PRO A 163 54.208 8.448 27.433 1.00 15.20 Q 

ATC»4 1430 CB PRO A 163 56.727 8.196 29.423 1,00 11.43 £ 

ATOK 1431 CG PRO A 163 51^^52 7,874 28.031 ItOO 13.99 £ 

ATOM 1432 CD PRO A 163 57,086 6.4P1 27.949 1.00 12,24 £ 

ATOM 1433 N THR A 164 53.478 9 .060 29.479 1.00 13.95 U 

30 ATOM 1434 CA THR A 164 52.581 10.121 28.963 1.00 25.82 £ 

ATOM 1435 C THR A 164 53-406 11,441 28.781 l.QO 19,67 £ 

ATCM 1436 o THR A 164 54.633 11.393 28.869 1.00 13,97 2 

ATOM 1437 CB THR A 164 51,373 10.391 29.903 1.00 25,51 £ 

ATOM 1438 QGl THR A 164 50.470 11.321 29.267 1.00 14,77 Q 

35 ATOM 1439 CG2 THR A 164 51.818 10.886 31.298 1.00 9.06 £ 

ATCT4 1440 N A5N A 165 5 2,75 1 12.599 28.556 1,00 14.99 M 

ATCM 14 4 1 CA ASN A 165 53.448 13.901 28.491 LiJifl 7.63 £ 

ATOM 144 2 c ASN A 165 54.167 14,064 29,824 — l.QO 11,21 £ 

ATOM 144 3 o ASN A 165 53.554 11^225 30,89^ 1.00 17,66 . Q 

40 AroH 1444 CB ASN A 165 52.434 15,061 29.416 LOO H,48 £ 

ATCM 144 S CG A5N A 165 51.492 14.941 27,262 1.00 23.70 £ 

ATOM 144 6 ODl ASN A 165 51.939 14.600 2.6. 1?9 1,00 22-37 Q 

ATOtl 1447 ND2 ASN A 165 50.173 14.925 27.539 1.00 27.22 N 

ATOM 1448 N LEU A 166 55.418 14.490 29.777 1.00 8.23 N 

45 ATOM 1449 CA LEU A 166 56.187 14.604 30,994 1.00 14.40 _£ 
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aiai CA MH A 177 52*224 22 .619 i-oo ,Sfi.4i q 

AlflM liH C A 177 57.810 24.497 22.3 53 1.00 60.91 £ 

AlflM 1542 Q ASH A 177 56.914 25.215 22.fll l l.OO 55.58 Q 

BXai CB AaW A 177 59.619 24 .469 24.065 1.00 50.45 £ 

5 hXm 1544 — eg ASM A 177 59.552 22.970 24 .319 i.oo 66.57 q 

AlflM 1545 PDl A5N A 177 59.095 22.216 23.476 1.00100.00 q 

AI2M 1545 HD2 ASM A 177 60.099 22.546 25 .464 1.00 35.61 U 

AlSai 1547 n SER A 178 57,583 23.387 21.627 1-00 57-10 U 

AlSai 1512 CA SBR A 179 56.234 22.853 21.279 1-00 50.50 £ 

10 AlflM 1549 g SBR A 17g 55,557 22.159 22.491 1.00 76.24 £ 

AlSaX l&5fi 2 SER A 179 54.575 21.400 22. 304 1.00 99-63 Q 

Aiai 1551 Cfi SER A 179 56.316 21 . boo 20.118 1.00 10.17 £ 

AlflM 1552 QQ SER A 178 57.397 22.112 19.217 1.00 71.69 q 

AlfiSM 1553 N HIg A 179 56.134 22.284 23.694 1.00 37.39 n 

15 aiSM 15i4 CA HIg A 179 55,569 21.587 24.855 1.00 30-96 Q 

AlfiM 1555 S Hlg A 179 54.961 22.616 25.7 67 1.00 21 .93 C 

AZflM 155£ Q HI3 A 179 55.641 23.598 26.1 38 1.00 25.17 Q 

AICM 1552 CB HIS A 179 56.634 20.683 25.575 1.00 36.20 c 

AlflM 155£ eg HIg A 179 56.973 19.419 24.835 1-00 42-90 £ 

20 AXSM 1559 WDl HIg A 179 55,973 19.335 23 .457 l.oo 49.52 u 

AXJai IMQ CDZ HIS A 179 57.323 18.190 25- 278 1-00 52-42 C 

AI£3M IS^l CBl HIS A 179 57.283 18-109 23.084 1.00 44-78 £ 

AXJ2M 1552 ??B2 HIS A 179 57.500 17.393 24.168 1.00 50-49 u 

Aiai 1^63 W VAL A 199 53.561 22x4^4 26,036 1,00 19.14 u 

25 AIi2M 1554 <^ VAL A 199 52.886 23.449 26-789 1.00 29.03 £ 

Aiai 1555 C VAL A 199 53.373 23.890 28,142 1.00 31.29 £ 

AlStl 1556 P VAL A 190 53,348 25 .075 28.447 1.00 19.55 Q 

AlfiM 1557 CS VAL A 199 51.403 23.115 26.914 1.00 35.47 £ 

AlfifeJ 15£fi Cgl VAL A 180 50.630 24.399 27.217 1.00 35.84 Q 

30 AIQtl 1569 Cg2 VAL A 199 50.923 22.550 25.663 1.00 36.11 £ 

AI£3M 152^1 n ILE A 181 53.684 22.935 29.005 1.00 26.57 

AlflM 1^21 — ILE A 191 54,139 23.265 30 .360 l.oo 24.49 C 

Aiai 1122 £ ILE A 191 55.371 24.213 30.3 61 1.00 16.51 c 

Aiai 1522 — Q ILE A 191 55.325 25,31^ 30.909 l.oo 24.42 q 

35 Alfitt 1524 Ca ILE A 191 54.285 22.018 31.264 1.00 20.20 £ 

AlJStl 1575 Cgl ILE A 191 52.978 21.428 31r52S 1,09 19.22 C 

AI12t4 1575 Cg2 ILE A 191 55.014 22 .315 32.581 1.00 13.37 £ 

AICM 1577 CPl ILE A 191 52.867 20.086 32.286 1.00 8.03 £ 

A113tl 1525 U PRO A 182 56.452 23,779 29.718 1.00 22.21 u 

40 AI2tl 1522 CA PRO A 18;> 57.664 24,605 29.640 1.00 22.07 £ 

AXSM 15LQ C PRQ A 182 57.379 2 5.852 28.826 1.00 24.18 £ 

AISM 15fil Q PRO A 182 57.811 26.949 29-21Q 1.00 18.35 o 

AlQfcl 1582 £fi PRQ A 182 58.682 23.725 28.890 1.00 24.97 c 

AIQM 1583 OS PRO A 182 57.925 22.473 28.471 1.00 75.77 £ 

45 AlCM 158^ PRO ft 182 56.727 22.359 29.401 1.00 18.23 £ 
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AICM liM — N AIA A 1B3 Sfi^JiZfi 25.707 ?7 .729 i nn ^t.4s u 

hXm 1586 CA AIA A 1B3 5g.261 26.B96 26 .943 i.on Pi.fifi q 

AIS3M 1587 C AIA A 183 55.464 27.900 27.81 1 i.nn ?fi,in £ 

AI£3M IMfi £2 ALA A 183 55.773 2 9.091 27.856 1.00 Q 

5 ATOM 1589 CB AIA A 183 55.473 26.513 25.703 1. 00 13.26 £ 

ATCM li2fl U LEU A 184 54.472 27.389 28.543 l.QQ 23.34 U 

A1S3M 1591 CA LBV A 181 53,^12 28.215 29,401 i.oo 19.05 £ 

AESai 1S22 £ LB? A 181 54,312 28.693 30 ,655 i.oo 21.91 £ 

Ami 1593 9 LBV A 191 54.017 29.771 31,158 I.oo i9.7i q 

10 AISM 1^24 gB LEV A 181 52.309 27.553 29.7 15 1.00 14.41 c 

AlSai LBV A 181 51.342 27.595 28. 525 l.QQ 23,42 £ 

AlSai 1596 CPl LBV A 181 19.918 27,244 28.928 l.QQ 31. p ^^ £ 

AISM 1S22 CP2 LBV A 181 51,380 28 .896 27.690 1 .00 21.7 3 £ 

AlfiM 1598 N LBV A 185 55,116 27.879 31 ,213 i.oo ia.39 h 

15 AICM 1522 CA LEV A 185 55.833 2B .332 32.417 1.00 16.3q £ 

AISM liOfl £ LBV A 185 56,669 29.528 31.9 85 1.00 23.67 £ 

AlSai l^Sn Q LEV A 185 56.681 30.590 32.6 44 1.00 29.3R Q 

AICM l§SiZ CB LEU A 185 56.723 27. 233 33.015 1.00 15.05 £ 

AIQM 1^93 eg LBV A 195 56.021 26 .348 34.041 1.00 15.56 £ 

20 aiCM 16P1 CPl LEV A 185 56.819 25.022 34.301 l.QQ 21.06 £ 

AICM 1^05 CT2 iiBV A 185 55.722 27.113 35.321 1.00 11.02 £ 

AICM lifii M ARI? A 18^ 57.337 29.397 30.852 1.00 17.09 ^ 

AICM l£iO — CA ARC? A 18g 58,137 30.523 30.429 i.oo ib.r? £ 

Alfiaa ifififi £ A 18^ 57,308 31.752 30 .069 1.00 29.00 £ 

25 ^IfiM 18Q9 O AR<^ A 18^ 57.629 32.880 30.476 1.00 23.91 £ 

AIS21 I^IP CB AR<? A 18g 59,026 30.146 2 9,281 1.00 22.06 £ 

AICM Lfell eg ARg A 1B6 59.653 31 .365 28.652 1.00 38.46 C 

AICM l^i2 CP ARg A 186 60,825 31 ,804 29.462 1.00 83.66 £ 

AICM 1613 Ng ARQ A 186 62.012 31,861 28.631 1.00 70.77 U 

30 AISIl 1611 £2 ARg A 186 63.058 32 .622 28.904 1.00 91 .68 £ 

AIGM 1615 ,. WHl ARG A 186 63.Q53 33.366 29 .995 1 .00 56.56 N 

AICM 1616 NH2 ARG A 186 64.098 32 .639 28.082 1 .00100.00 H 

AICM 1617 N ARg A 187 56.234 31,544 29.310 l.OD 20.96 u 

AICM IflS CA ARg A 187 55.361 32 .662 28.941 1.00 19.32 £ 

35 AISM 1619 C ARg A 1B7 54.765 33 .453 30.142 1.00 28.41 £ 

AICM 162P Q ARg A 187 54.823 34 ,700 30.193 1.00 17.23 Q 

AmS 1^21 CS ARg A 187 54.270 32 .223 27.957 1.00 17.05 £ 

AIS21 1622 eg ARg A 187 54.813 31,546 26.720 1.00 61.42 £ 

Aliai 1623 CP ARg A 187 53.696 31.244 25.757 1 .00 44,57 £ 

40 l£2i NE A Rg A 18 7 53.033 32.472 25.354 l.QQ 29.47 n 

AICM 1625 QZ ARg A 187 51.831 32 .534 24.790 l.QQ 17.87 £ 

AICM 1626 NHl ARG A 187 51.136 31 .427 24.544 1.00 24.95 N 

ATPM 1627 HH2 ARg A 197 51.341 33.716 24.447 1 .00 37.77 n 

Ami IfiZS U FHg A 188 51,192 32 .734 31.101 l.DO 23.48 ^ 

45 AICM 1629 CA fHB A 198 53.604 33.399 32.259 1.00 21 .24 c 
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M£3M Ifilfl Q raS A 188 54-638 34.060 33.0 95 1.00 21.39 Q 

MSM IfiH Q PHB A Igg 54.394 35.126 33.fi; >6 1.00 73.90 Q 

aiCM Cfl PHE A 196 52.723 32.466 33-0 77 1.00 19.95 £ 

MJail 1633 eg PHB A IBS 51,389 32.215 32.43 5 1.00 22.28 £ 

5 ATSaM 1631 CDl PHE A 18B 50.440 33.229 32.375 1 .00 19.42 Q 

AIQM 1635 CP2 PHB A 168 51.141 31.03B 31. 734 i.oo 7i fl2 q 

AICM Cai PHE A 18.S 49.191 33.026 31.742 1.00 24.77 Q 

AlSaO 1637 CE2 PHE A 188 49.936 30.826 31.057 1 on 20,17 g 

AlOa 1636 £2 PHE A 188 48.945 31.815 31.068 l.on 2114 £ 

A1S2M 163? N HIS A 18? : 55,831 33.513 33.118 l.on ^ 

AISM 1640 CA HIS A 189 56.93 3 34.122 33.837 1.00 28.79 £ 

AI2il 1641 C HIg A 18? 57.303 35,506 33.315 l.OQ 2fi-5fl q 

AlflM 1M2 — 2 HIS A 18? 57.480 36.463 34.083 I.oo 20.07 q 

AlSai 1M3 CB HIS A 18? 56.148 33 .268 33.641 1.00 31 .38 £ 

15 Aiai 1M4 gg HIS A 18? 59.364 33,844 34.290 1.00 29.98 q 

Aisaa 1645 MDl HIS A 18? 5?. 548 33.833 35 .658 i.oo 3i-00 n 

Aiai 1646 CP2 HIS A 18? 60.44? 34.464 33.766 i.oo 21.79 Q 

AIGM 1647 CEI HIS A 18? 60.722 34.371 35.9 45 1.00 24.04 q 

AlCai IMJi HE2 HIS A 18? 61.257 34.815 34.821 1.00 19.5-t N 

20 AISaM 164? N gLV A 1?Q 57.539 35 .561 32.006 1.00 28.41 ^ 

AXSai 165Q CA glrV A 1?P 57.876 36.816 31.32 4 l.OQ 27.72 £ 

AlSai 1651 C gLV A 1?P 56.725 37 .829 31.437 1.00 32.56 £ 

Ami 1652 Q gliV A 1?C> 56.949 38-995 31.7 17 1.00 27.06 o 

AlSaa 165? CJa GW A 190 58.122 36-529 29.849 l.OQ 28.55 £ 

25 AXaf 1654 eg gLV A 190 59.150 35.461 29.614 1.00 35.29 Q 

AlSai I655 CJ2 gLV A 1?P 60.553 35,941 29.892 1.00 99.81 C 

AIQM 1656 OEl GXU A 190 60.913 36.037 31.085 1.00 86.56 Q 

AICM 1657 0E2 gLP A 1?Q 61.293 36.167 28.910 1. 00100.00 o 

AXSaCl 1658 n ALA A l?.l 55.493 37.391 31.196 1.00 32.67 u 

30 ATfifcl 1659 CA ALA A 1?1 54.349 38.286 31.311 l.OQ 25.30 £ 

AICM IMfi C AIA A 191 54.287 38.795 32.742 1.00 36.20 £ 

AlflM 16.61 Q ALA A 1?1 53.920 39.924 33.014 1.00 27.52 O 

Alfitl 1662 CE ALA A 1?1 53.055 37.563 31.000 1.00 16.48 £ 

Aiai 1663 U. THR A 1?? 54.549 37.927 33.693 1-00 29.39 

35 A1£3M L664 XHR A 1?2 54.395 38,386 35.041 l.OQ 19.08 q 

AISM 1665 e THR A 1?2 55.420 39,494 35.298 1.00 44.78 Q 

AlSai 1666 Q THR A 192 55.094 40.550 35.839 l.OQ 40.58 q 

ATOM 1M7 CB THR A 1?2 54.515 37.235 35.963 l.OQ IB. 99 ^ 

Aiai 1668 ggl THR A 1?2 53.410 36.348 35.755 1.00 34.36 q 

40 Aiaa; 1669 Cg2 THR A 1?2 54.461 37.736 37.425 1.00 21.15 £ 

AX2M ma — U ALA A 1?3 56.617 3?,312 34.757 1.00 48.58 u 

AXQll mj CA ALA A 1?3 57.705 40 .286 34.905 1.00 50.59 q 

Alfiil 1322 C ALA A 1?3 57.496 41 .613 34.145 1.00 54.47 q 

AlflM 1673 Q ALA A 1?3 57.952 42 .698 34.553 1.00 4R.7B Q 

45 ATOK 1674 CB AIA A 193 59.047 39.640 34.496 1.00 51 .7fl q 
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AICM 167? M GLN A 194 56.810 41,530 33.022 1,00 43.16 U 

Alflfcl 167§ Cft gLN A 191 56.566 42,722 32.242 l.QQ 38.03 Q 

ATSai lg77 C gUf A 194 55,264 43.389 32.57 6 1.00 40.85 Q 

AICM Ifiia Q gLW A 191 54,830 44,284 31.845 1.00 51.20 q 

5 ATOM 1679 CB GLN A 194 56.599 42.358 30.750 1,00 35.96 Q 

AlCM leSQ eg GIM A 194 57.910 41.692 30-290 1.00100.00 Q 

AlCtt IfifiO CIJ gUf A 191 57.715 40.661 29. 158 1.00100.00 £ 

AIS3M 1682 Qgl GLN A 194 56.619 40.546 28.579 1 -QQlQO.OO Q 

AISM lee? MB2 girff A 191 58.782 39,904 28.848 1.00100.00 {J 

10 AISM ie?1 W gLY A 195 54.583 42.949 33.63 0 1.00 32.29 U 

ATCM 1685 CA GLY A 195 53.236 43.464 33.864 1.00 36.26 £ 

AlSaM 1^9^ <; gliY A 19$ 52.299 43,332 32-593 1.00 45.33 Q 

ATOM 1687 Q GLY A 195 51,515 44.242 32.346 1.00 45.16 Q 

Ami leeg k gly a i96 52.40 5 42,245 31.788 1.00 36.33 u 

15 AIOM lee? CA gLY A 19$ 51.515 41,965 30.608 1.00 19.06 Q 

Alfit* 1^90 C gLY A 19$ 50,037 41,956 31.11 7 1.00 22.49 £ 

ATfiM leai Q GLY A 196 49.72 4 41.479 32.223 1.00 33.09 Q 

Mai lg92 If PRO A 197 49,144 42.657 30,431 1.00 29.22 ^ 

Aiai ie93 CA PRO A 197 47.790 42 .732 30.953 1.00 25.29 £ 

20 AIQfcl 1691 £ PRO A 197 47.091 41.413 30. 674 1.00 24.64 £ 

Aiai 1695 Q PRO A 197 16.192 40.991 31 .411 i.oo 24.75 q 

AIQM 1696 CB PRQ A 197 47.162 43.911 30.176 1.00 26.31 £ 

AllSM ££ PRQ A 197 48.188 44 .407 29.252 1.00 26.56 £ 

AIQtl 1699 CI2 PRO A 197 49.307 43.454 29.203 1.00 30.25 £ 

25 AIQl 1699 U A?P A 199 47.572 40.723 29.658 1.00 16.88 ^ 

AXai 1709 gA AgP A 199 47.067 39 .418 29.405 1.00 21.65 £ 

AIS2t« 1701 C ASP A 198 48.046 38.522 28.677 1.00 31.28 £ 

AISM 1702 O ASP A 198 49.062 38.978 28.172 1.00 34.57 O 

AIGM 17P3 CB ASP A 198 45.739 39.507 28.669 1.00 32.80 £ 

30 ATOM 1704 CG ASP A 198 45.868 40,055 27,256 l.OQ 46.13 £ 

Alffll 1705 OPl A?P A 199 46.982 40.230 26.725 l.OQ 57.45 Q 

AIQl 17Q6 QP2 A^P A 199 44.817 40 .271 26.640 1 .00 67.61 Q 

AISM 17P7 ti VAL A 199 47.713 37.234 26.614 1.00 38.67 ]S 

Aiai 12fi5 CA VAL A 199 48.499 36 .226 27.901 1.00 27.79 £ 

35 AXOM 17Q9 C VAI^ A 199 47.462 35 ,469 27.065 1.00 25. 8B £ 

AIQM 1710 Q VAL A 199 46.460 35.023 27,598 l.OQ 24.22 Q 

Airai Hi! £B VAL A 19? 19,16? 35,229 28 .905 l.OQ 24.37 £ 

AI521 1112 Cgl VAL A 199 49.874 34,047 28.160 l.OQ 2Q.2fl £ 

AI2M 1212 Cg2 VAL A 199 50.121 35,942 29.835 l.OQ £ 

40 AlfiM 1714 N VAL A 200 47.661 35.386 25.757 1.00 23.72 ^ 

AIS21 1215 — CA VAL A 20Q 16,701 31.691 24.903 i.oo 71.99 £ 

AI5»3 1716 C VAL A 200 47.167 33.286 24.499 1.00 7?.fl5 £ 

Ami 1212 — Q VAL A 200 19,321 33,109 24.188 i.qq 77 q 

AlSai 1719 .CB VAL A 200 46.358 35.548 23.680 T.QQ 7-^.11 £ 

45 Ami 1719 CGI VAL A 200 45.561 34.737 22.598 1.00 lfi ?S £ 
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ATOH 1720 CG7 VAT. A 300 45.65^ 36.623 24.130 1.00 27.86 Q 

ATOM 1721 N VAT A 201 46.296 32.278 21. «2 l.QQ 27.3? H 

ATCT4 1722 CA VAL A 201 46-588 30.893 Z^LJi£^ 1.09 9.63 C 

Aim 1723 C VAL A 201 15.653 30.52? 23.165 3.. 09 19,63 C 

5 AlflM mi Q VAL A 201 11.452 30.755 23.312 1,00 17.63 Q 

ATCT4 172S CB VAL A 201 46.306 29.952 25.426 1.00 19.95 £ 

AlfiM 1225 Cgl VAL A 201 46.703 28.519 25.054 1.00 20.85 £ 

A1S3M 1727 Cg2 VAL A 201 ^7.0B6 30.439 26.661 1.00 16.73 £ 

AlflM 12Za M TRg A 202 46.210 30.080 22.030 1.00 14.36 ^ 

10 ATOM 172 9 CA TRP A 202 45.422 29.693 20.9^5 1.00 18,97 Q 

ATCM 1730 S TRg A 202 11.195 29t572 21.313 ItOP 36,22 £ 

him mi Q TRP A 202 44.934 27.694 22.057 1.00 31.46 Q 

ATOM 1732 CB TRP A 202 46.292 29.055 19.823 1.00 19.14 £ 

hjm 1733 CG TRP A 202 17.213 29.891 19.066 1.00 33.65 £ 

15 mil CPl TRP A 202 48.391 29.463 16.129 1,00 35.29 £ 

AISM 1735 CP2 TRP A 202 17,126 31>282 19.772 1.00 39,90 £ 

ATOM 173 6 NEl TRP A 202 48.941 30,181 17,693 1.00 37.86 U 

ATOM 1737 Cg2 TRP A 202 48.228 31.624 17.922 1.00 38.35 £ 

AISM 1 73 9 £B3 TRP A 202 16- 2Q6 21^2^ 19,139 1,90 39,39 £ 

20 ATOM 1739 C22 TRP A 202 48.380 32.884 12^231 ;i.QO 36,15 £ 

ATOM 174 0 CZ3 TRP A 202 16.356 33.512 19,579 ItOO 39,60 £ 

ATfiM mi CH2 TRP A 202 47.428 33.828 17,691 1>00 40.99 £ 

AlflM 1712 H gLY A 203 13.215 29,561 20.912 1.90 25,59 U 

ATCf i 17 13 CA gLY A 203 12.332 27.193 21.169 ItOQ 13,99 £ 

25 ATOM 1744 £ gLY A 293 11.260 27.813 22.193 1.00 21.12 £ 

ATOM 1745 o GLY A 203 11,319 29,915 22.996 1,90 22.96 Q 

ATOM 174 6 N 3ER A 204 40.270 26 .919 22.262 1.00 16.88 H 

ATOM 1747 CA 3ER A 204 39.163 26.979 23.192 1,90 18. 3^ £ 

ATOM 174 8 C SER A 204 39.561 26.664 24.659 1.00 22.07 £ 

30 ATOM 1749 o SER A 204 38.999 27.096 25.601 1,00 31,39 Q 

ATOM 1750 CB SER A 204 38.053 25.998 22.740 1,90 9,99 Q 

ATCM 1751 OG SER A 204 38.237 24.695 23.291 1.00 16.37 O 

AlfiM 1252 ^ gLY A 295 1 Qt5 62 25.813 24.854 1,90 12t12 M 

ATCM 175 3 CA GLY A 205 40.963 25.411 26.208 1.00 11.64 £ 

35 ATCM 175 4 c GLY A 205 10,209 21,179 26.711 ItOQ 19f19 £ 

ATCM 175 5 Q GLY A 205 ^SLAZZ 23.723 27.838 1.90 13,59 2 

ATOM 175 6 W THR A 206 39.292 23.683 25.881 1.00 15.38 M 

ATOM 1757 CA THR A 206 38 .432 22.594 26.281 1.00 10.80 £ 

ATCTi 1758 C THR A 206 39.056 21.221 26,151 ItOQ 26.39 £ 

40 ATOM 1759 o THR A 206 39.561 20.267 26.737 1,90 23.28 Q 

ATOM 176 0 CB THR A 206 37,121 22,562 25.160 1,90 12.96 £ 

ATC^ 1761 OGI THR A 206 37.138 22,395 21.992 1.90 13,12 Q 

ATOM 1762 CG2 THR A 206 36.348 23.840 25.629 1.90 10.62 £ 

ATGH 17 63 N PRO A 207 40.101 21 .083 25.354 1,90 21.10 U 

45 ATOM 17 64 CA PRO A 207 40.65B 19.743 25,17.5 1 ■ OQ 19,15 £ 
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ATW 17^? C PRO A 297 11,31^ 26,123 l.QQ 21.7? C 

ATOM 1766 o PRO A 207 11.??1 — 1?.?25 27.215 1 ■ PQ 20.65 Q 

AT04 1767 CB PRO A 207 11.638 19.903 21.P13 l.QQ 17,53 C 

ATOM 176 8 CG PRO A 207 11,116 21.213 23.307 1.00 21.15 C 

5 XTCM 1769 CD PRO A 207 40.698 21,131 1,00 23.11 C 

ATCM 1770 N MET A 208 41.112 17.876 26.624 1 - DO 15.60 U 

MOM 1771 CA MET A 208 41,694 17t167 27.775 1 .00 22.91 £ 

ATC»< 1772 C MET A 208 43,058 16,427 27,579 1.00 21.90 £ 

Ami 1773. 0 MET A 208 13.219 15,^77 26.63? 1.00 23.16 Q 

10 ATCT^ 1774 CB MET A 208 40,645 16.273 28.386 1,00 32,86 £ 

ATOM 1775 CG MET A 208 ' 39.630 17.057 29,22? 1,00 16.17 £ 

ATOM 1776 3D MET A 208 38.301 15.990 29.826 1.00 57.85 S 

ATOM 177 7 CE MET A 208 37.999 15.028 28.313 1.00 58.23 C 

ATOM 1778 N ARG A 209 11.022 :t6. 681 28.156 1.00 17,75 U 

15 ATOM 1779 CA ARG A 209 45.318 16.042 28.324 1.00 19.88 £ 

ATOM 1780 C ARG A 209 45.871 15,534 29,639 1,00 16,92 £ 

ATOK 178 1 O ARG A 209 45.433 15.946 30.697 l.QO 16.58 £ 

ATC»< 178 2 CB ARG A 209 16.310 16.963 27.658 1.00 21.07 £ 

ATOM 1763 CG ARG A 209 45.980 17.178 26.275 1.00 22,57 £ 

20 AroK 178 4 CD ARG A 209 15-83? 16-357 25.282 1.00 28,26 £ 

ATOM 178 5 NE ARG A 209 15.586 16.819 23.906 1.00 23,15 U 

AICM 1786 cz ARG A 209 11.120 16.712 23.267 1.00 31,52 £ 

Aim 1787 UHl ARg A 209 13,336 16.267 23.890 1.00 18.0? K 

ATOM 1788 KH2 ARG A 209 44.339 17. 175 22.012 1,00 29,78 U 

25 ATCM 1 7B9 U gLV A 210 16, 878 11.675 29.517 1.00 20,87 u 

ATOM 1790 CA GLU A 210 17,530 14.079 30,720 IrOO 17, ?7 C 

ATW 1791 C GLU A 210 49.031 14.490 30.851 1.00 20.96 C 

ATOM 1792 O GLU A 210 19^718 11.622 29,811 1.00 22,41 O 

ATOM 1793 CB GLU A 210 17,100 12,562 30,571 1.00 16-26 £ 

30 ATOM 1794 CG GLU A 210 47.807 11.7B5 31.809 1.00 19.91 £ 

ATOM 1795 CD GLU A 210 18,057 10.301 31.531 1.00 27.81 £ 

ATCM 1796 OEl GLU A 210 l^^JULl 9.919 30.313 l.QO 17,29 Q 

ATOM 1797 OE2 GLU A 210 18,268 9.510 32.191 1,00 21,63 o 

hJW 1 7 98 W PHE A 2i; 19,501 11.712 32.081 1.00 11,02 U 

35 fi^M 1799 CA PHE A 211 50,89? 15.159 32.?53 1.00 31.7,18 £ 

ATOM IBOO C PHE A 211 51.458 14.414 33.531 1.00 33.62 £ 

ATOM 1801 o PHE A 211 50,71g 11.031 31.113 1,00 27,96 S 

ATOM 18 0 2 CB PHE A 211 50.933 16.677 32,611 1,00 17.78 £ 

ATCM 18 0 3 CG PHE A 211 50.303 17,190 31.511 1.00 21.19 £ 

40 ATCTi 1804 cm PHE A 211 51.009 17.676 30,320 1,00 17.36 £ 

ATOK 180 5 CD2 PHE A 211 18,933 17.811 aioJ&lfi 1.00 15,09 £ 

ATOM 1806 CEi PHE A 211 50.399 18.331 29.237 1,00 16,37 £ 

ATC»< 1807 CE2 PHE A 211 48.288 18.491 30.533 l.OQ £ 

ATOM 180 8 PHE A 211 19,053 18,756 29,311 1.00 12,73. £ 

45 ATCM 180 9 W LRU A 212 52.761 14.161 33.195 1,00 23.76 U 
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AlCM mfl CX LEU A 212 53.405 13,448 34.603 1.00 21.74 £ 

iOSSi Ull C liEV h 212 54.7-72 14 .053 34.898 1.00 14.00 Q 

Ami m2 Q iiEy a 212 55.519 i4.39b 33.985 1.00 13.99 q 

AICM U12 CB LEV A 212 53.548 11.954 34.294 1.00 21.52 Q 

5 AlflM Cfi LBU A 212 54.033 11.039 35.4 06 l.QQ 21.09 £ 

AlflM mS CDl LEV A 212 52.866 10.634 36. 280 1.00 20.84 Q 

AlflM IfiUS CD2 LEU A 212 54.768 9.829 34.832 1-00 13.18 C 

AlCM im M HIS A 213 55.023 14.302 36.1 75 1.00 9.60 H 

Alffll £A HIS A 213 56.290 14 .B64 36.555 1.00 13.66 £ 

10 ATCM m? £ HI5 A 213 57.380 13.828 36.293 1.00 20.37 Q 

AlCM Q HIS A 213 • 57.239 12.JUJ 36.542 l.oo 16.08 Q 

ATOM 1821 Cfi HIS A 213 56.280 15.250 38.002 1-00 18.72 £ 

AlfiM U22 W HIS A 213 57.491 16 .017 38.408 1.00 21.22 Q 

AlfiM U22 NPl HIS A 213 58.703 15 .406 38 .656 1.00 24.29 g 

15 ATCM 1824 CD2 HIS A 213 57.716 17.353 38.499 1.00 23.67 Q 

Alffli CSl HIS A 213 59.615 16.331 38.9 17 1.00 19.13 £ 

AISM 1926 HE2 HIS A 213 59.041 17.523 38.8 47 1.00 21.99 N 

Alffll 1927 N VMi A 214 58.459 14 ,295 35.698 1-00 21.07 U 

AICM lfiZ5 CA VAI/ A 214 59.532 13,383 35,3^1 l.QP lg,23 C 

20 AlCH C VAL A 214 60,067 12.523 36.551 1.00 27.20 Q 

AlflM Iflifi Q VAI. A 214 gQ,6Q4 11.444 36.359 1.00 22.23 Q 

Alflaa im CB YAL A 214 60.625 14 .125 34.566 1.00 11. B4 £ 

Aiai 1832 CGI VAL A 214 61.390 15 .199 35.485 1-00 6.52 £ 

ATOM 1933 TO2 VAL A 214 61.560 13.097 33.902 1.00 12.39 £ 

25 hlSM mi M AffP A 215 59.893 12 .984 37.790 1.00 25.29 U 

ATgW 1935 £A ASP A 215 60.406 12.228 38.936 1.00 18.19 £ 

AX2M 183^ C ASP A 215 59.53P lljJlZl 39.230 i. OO 13.85 c 

AISM 1837 Q A?P A 23,5 59.988 9.981 39.666 1.00 17.44 O 

AlfiM 1839 £B ASP A 215 60.575 13.129 40.155 1.00 16.27 C 

30 ATPM 193? £S ASP A 215 61,859 13.979 40.068 l.QQ 30.73 £ 

AISH 1840 QPl ASP A 215 62,762 13.614 39.308 1.00 23.02 O 

AlSai 1841 0P2 A3P A 215 61.957 15.029 40.730 1.00 26.00 Q 

AIQSi lfi42 U ASP A 216 58.276 11-136 38.863 1.00 20.08 ^ 

hXm 1843. CA ASP A 216 57.378 10.017 39.016 1 .00 18.78 £ 

35 ATOM IMi £ ASP A 21$ 57.761 9 .083 37.894 1.00 23.56 £ 

1115 Q ASP A 21$ 57.715 7.880 36.026 1.00 20.79 Q 

A12M 194$ CB ASP A 21$ 55.912 10.457 36.821 1.00 17.18 £ 

Alfifcl 1M2 CG A gl6 55.193 10,757 40,1 62 1.00 38.03 £ 

ATOM 1948 PPl ASP A 21$ 55.503 10.119 41.223 1.00 26.02 O 

40 ATSai 1849 PP2 ASP A 216 54.249 11.587 40.124 1.00 25.41 Q 

AlCSkl 1850 n MET A 217 58.092 9.653 36.755 1.00 18.11 U 

AlSai 1651 CA MET A 217 58.394 8.785 35.636 1.00 22.41 £ 

AISM £ MET A 217 59.572 7.942 35.992 1.00 27.54 £ 

ATCM 1853 O MET A 217 59.579 6.752 35.710 1.00 20.86 £ 

45 him 1854 CB MET A 217 58.637 9.592 34.345 1-00 21.24 £ 
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AXCM 1955 CQ MET A 217 59,47B 8.91fl 33.2fl7 1.00 16.37 Q 

AlCM 1Q5^ ?p MET A 217 58.962 7,412 32.473 1.00 30.51 a 

aiCai 1B57 CB MET A 217 57.465 7.608 32-391 1.00 19.57 Q 

AliSH ma H ALA A 218 60.561 B.S62 36.623 1.00 19-09 U 

5 AIC83 1859 CA ALA A 219 gl.771 7.841 37. 002 l.oo 13.65 £ 

ATCM lfl£fl £ ALA A 219 61.436 6.778 3fi-028 1.00 22-61 £ 

AlSai IMl Q ALA A 219 61.934 5 .670 37.967 1.00 19.36 Q 

AlSai 1992 gB ALA A 219 62.99? 8.780 37. 579 l.oo 12.23 q 

Aiai 1M3 n ALA A 219 60.605 7.109 39.000 1.00 19-34 U 

10 1991 gA ALA A 219 60.31P 6.105 40.023 l.oo la.oi q 

AISM 1M5 £ AIA A 219 59. 630 4.901 39.413 1.00 23.57 £ 

AlCM 1996 <? ALA A 219 59.781 3.777 39.8 98 1.00 22-71 Q 

ATCM 1967 gp ALA A 219 59.387 6,678 41,083 1.00 10.11 q 

Aig« 1869 N ALA A 23Q . 58,753 5.174 38.454 1.00 18.99 U 

15 AICM 1999 CA ALA A 229 57.905 4.158 37.855 1.00 14.12 c 

AiaS lllfl 2 ALA A 229 58.753 3,213 37.034 1.00 25.33 £ 

Aiai laii Q ALA A 22Q 58.584 2,006 37.114 1.00 20.63 Q 

Aiai 1972 CB ALA A 229 56.796 4.798 37.023 1.00 8.53 £ 

Ami lfl23 U SER A 221 59.770 3.772 36.379 1.00 23.92 

20 Aiai 1871 CA SER A 221 60.702 3.011 35.556 1.00 18.38 £ 

AI£ai laiS £ SER A 221 61.537 1.989 36.353 1.00 20.90 £ 

AISM 1976 O gSR A 221 61.683 0 .799 35.963 l.OQ 19.84 Q 

AlflM IfilT £B SER A 221 61.604 3.985 34.804 1.00 10.67 £ 

Aiai Ifllfi Q£ SER A 221 60.847 4.744 33.8 67 1.00 15.61 Q 

25 AUai 1979 H IL£ A 222 62.083 2.476 37.463 1.00 18.12 U 

AXai IBM XLfi A 222 62.866 1 ,644 38.381 1.00 21.56 £ 

Ami 1991 C ILE A 222 62.020 0.554 39.0 68 1.00 29.10 C 

ATW 1982 O ILE A 222 62,504 -0.566 39.307 1,00 19,03 Q 

Aliai 1883 CB ILE A 222 63.467 2.516 39.432 1.00 24.56 £ 

30 AlStl 1891 Cgl ILE A 222 64,465 3.473 38.765 1.00 32.13 C 

ATCM 1M5 QQZ ILE A 222 64.129 1.671 40.500 1.00 28.26 £ 

Ami lfll£ CDl ILE A 2:^2 64.973 4.585 39.6 49 1.00 15.61 £ 

AIQJ3 1997 N HIS A 223 60.772 0.907 39.384 1.00 19.34 u 

AlSai 1999 HIS A 223 59.829 -0.031 39.996 1.00 20-46 £ 

35 AlSai 1999 C HI? A 223 59,599 -1.097 38.964 1.00 24.82 £ 

Ami 1999 O HIS A 2?3 59.723 -2,283 39.27 0 1.00 24,66 O 

Ami 1991 CB HIS A 223 59.195 0,637 40.359 l.oo 19.53 £ 

Ami m2 CG HIS A 223 57.373 -0.333 40.759 1.00 28.64 £ 

AlSai 1993 NPl HIS A 223 57.021 -0.564 42.082 l.OQ 24.16 ^ 

40 aifiSM 1991 CP2 HIS A 223 56.497 -1.062 40.004 l.OQ 30.39 £ 

AXOil 1995 CEl HIS A 223 55.983 -1,399 42.112 1.00 30.39 £ 

AlSai 1896 NE2 HIS A 223 55.652 -1.727 40.8 69 1.00 28.13 M 

AlCai 1897 N VAL A 221 59.354 -0 ,684 37.725 1.00 22. OQ n 

Ami 1999 VAL A 224 59.111 -1.657 36.652 1.00 19.15 £ 

45 AtSM 1999 <; VAL A 221 60.350 -2.490 36,333 i-QQ 25.89 £ 



wo 99/64618 



PCT/US99/11576 



124 





ATOM 


1900 


0 


VAL A 224 


$0,282 


-3 


.709 


36 


.250 


1 


.00 


22 


-37 


o 




ATOl 


1901 


CB 


VAL A 224 


58.559 


-1 


.022 


35 


.377 


1 


.00 


22 


.59 


0 




Aiai- 


1?02 


C61 


VAIi A 224 


SB. 512 


-2 


.050 


34 




1 


.00 


22 


.61 


0 




ATOM 


1^02 


CG2 


YAL A 224 


57.161 


-0 


.491 


35 


.650 


1 


.00 


23 


.44 


c 


5 


hJM 


1904 


K . 


HBJ A 225 


61.499 


rl 


.939 


36 


.255 


,1 


.00 




.93 


W 




ATOM 


1905 


CA 


MET A 225 


62.710 


-2 


.577 


36 


.004 


1 


.00 


23 


.69 


c 




ATOM 


1906 


C 


MET A 225 


62.996 


-3 


.679 


37 


.071 


1 


,00 


31 


.95 


0 




ATC3M 


1907 


..0 


MET A 225 


63.290 


-4 


.905 


36 


.795 


1 


.00 


24 


,33 


o 




ATOM 


1908 


,..CB., 


MET A 225 


63.902 


-1 


.604 


36 


.056 


1 


,00 


21 


,34 


0 


10 


ATCM 


190?,. 


.CS 


MET A 225 


65.295 


-2 


.296 


35 


.999 


1 


tOO 


17 


.83 


c 




ATOM 


1910 


SV 


MKT A 225 


65,750 


-2 


,959 


34.306 


1 


,00 


23 


.33 


? 




AT«t 


1911 


OS 


MET A 225 


67.090 


-1 


.996 


33 


.785 


1 


.00 


16 


.46 


c 




ATCM 


1912 


N 


GLU A 226 


62.644 


-3 


.319 


38.316 


1.00 


19 


.54 


N 




ATCM 


1913 


CA 


GLU A 226 


62.988 


-4 


.161 


39 


.429 


1 


rOO 


21.58 


c 


15 


ATOM 


1914.- 


0 


GLU A 226 


61.999 


-5 


.200 


39, 


.919 


1, 


.00 


30 


.77 


0 




ATCM 


1915 , 


0 


GLU A 226 


62.309 


-6, 


.012 


40, 


.790 


1, 


.00 


29 


.39 


o 




ATCM 


1916 


OB 


GLU A 226 


63.613 


-3. 


.323 


40, 


.547 


11 


,00 


20 


.47 


c 




ATCM 


1917.. 


0<? 


GLU A 226 


64.937 


-2. 


.673 


40. 


.122 


1, 


.00 


23 


.03 


c 




ATOM 


1918 


OP 


GLU A 226 


65.504 


-1. 


.909 


41, 


,209 


1, 


rOO 


32 


,62 


0 


20 


ATOM 


1919 


0£1 


QJiV A 226 


64.721 


-1, 


,455 


42, 


.122 


1, 


.00 


26. 


.12 


0 




ATCM 


1920 


0£2 


GLU A 226 


66,711 


-1, 


,479 


41, 


,152 


l1 


rOO 


17, 


,67 


o 




ATOM 


1921 


If 


LEU A 227 


60.937 


-5. 


r249 


39, 


.295 




.00 


34. 


,11 


N 




ATOM 


1922 


OA 


LEU A 22 7 


59.883 


-6i 


r296 


39, 


,642 


1.00 


35, 


r26 


c 




ATOM 


1923 


0 


LEU A 227 


60.537 


-7. 


644 


39. 


,320 


1. 


.00 


27, 


,91 


0 


25 


ATCM 


1924 


0 


LEU A 227 


61.291 


-7. 


766 


38.340 


1.00 


1?, 


,89 


o 




ATCM 


1925 


0& 


LEU A 227 


58.693 


-6. 


236 


39. 


679 


IrOO 


36, 


49 


c 




ATCW 


192 6 




LEU A 227 


57.381 


-5, 


569 


39. 


955 


1.00 


40. 


30 


0 




ATC»^ 


1927 


OPl 


LEU A 227 


57.697 


-4. 


194 


39, 


392 


1, 


op 


42, 


,04 


c 






1929 


0P2 


LEU A 22 7 


56.610 


-5. 


577 


37. 


647 


If 


00 


46. 


21 


c 


30 


ATCM 


1929 


N 


ALA A 228 


60.026 


-9r 


699 


39. 


955 


1. 


00 


27. 


15 


N 




ATCW 


1930 


OA 


ALA A 228 


60.425 


-10. 


051 


39, 


616 


1, 


00 


25, 


26 


, 0 




ATOM 


1931 


0 


ALA A 228 


59.901 


-10. 


435 


39. 


279 


1. 


00 


27. 


93 


c 




ATOM 


193? 


0 


ALA A 228 


59.624 


-lOr 


093 


37, 


934 


1 . 


00 


31. 


26 


o 




ATOM 


193? 


OS 


ALA A 228 


60.003 


-11. 


052 


40, 


703 


1 .00 


22. 


05 


0 


35 


ATCM 


1934 


N 


HIS A 229 


60.624 


-11. 


160 


37, 


539 


1. 


00 


27.05 


N 




AT»! 


1935 


OA 


HIS A 229 


60.275 


-11. 


605 


36. 


222 


1. 


00 


24, 


42 


c 




ATCM 


1936 


c 


HIS A 229 


59.905 


-12, 


260 


36. 


184 


It 


00 


21. 


74 


c 




ATOM 


1937 


0 


HIS A 229 


58.015 


-11.851 


35.398 


1, 


00 


22. 


22 


p 




KTOK 


1939 


OB 


HIS A 229 


61,351 


-12. 


520 


35. 


699 


1. 


00 


17. 


71 


c 


40 


ATOM 


1939 


00 


HIS A 229 


61.294 


-12. 


701 


34, 


220 


1. 


00 


27. 


24 


0 




ATCM 


1940 


NPl 


HIS A 229 


61.060.. 


-11. 


650 


33. 


350 


1. 


00 


34, 


38 


N 




ATCM 


1?41 


0P2 


HIS A 229 


61.292 


-13. 


921 


33. 


465 


1- 


oo 


31, 


45 


0 




ATC»4 


1942 


C£l 


HIS A 229 


60.992 


-12, 


113 


32. 


115 


1. 


00 


30. 


50 


C 






1943 


NS2 


HIS A 229 


61.124, 


-13, 


427 


32. 


159 


It 


00 


35, 


23 


N 


45 


ATOM 


1944 


N 


GLU A 230 


59,691 


-13. 


161 


37. 


140 


1, 


00 


20. 


24 


N 



wo 99/64618 



PCT/US99/11576 



125 





ATOM 


1945 


CA 


GLU A 230 


57 


.425 


-13.995. 


37.209 


I. 00 29,41 


C 




AXCK 


1946. 




GXtV A 230 


56 


.191 


-13.051 


37.341 


1.00 22.20 


C 




ATOM, 


1947 


0 


GLU A 230 


55 


^159 


-13.359 


36.679 


. IrOO 17,79 


Q 




Arm 


194B 


CB 


GLU A 230 


57 


.464 


-14.99,7. 


39.274 


1.00 38.51 


c 


5 




1?4? 


re 




A 230 


59 


.095 


-14.582 


39.567 


1.00 63,09 


C 




ATOM 


1950 


CI? 


GLU A 230 


57 


.036 


-14,473 


40.661 


1.00100.00 


C 




ATCM 


1951 




GLU A 230 


55 


.859 


-14,972. 


40.400 


1.00100.00 


o 




ATCM 


1952 






A 230 


57.409 


-14.003 


41,769 


l.QO 91.49 


Q 






1953 


N 


YAIi 


A 231 


56 


.272 


-12.004 


39.192 


l.oo 16.53 


N 


10 


ATCM 


1954 


CA 




A 231 


55.202 


-11.029 


39.356.. 


1.00 20.23 


C 




AT«1 


1955 


Q 


VAL A 231 


55 


.009 


-10,164 


37.102 


1.00 24.45 


c 




AXfiU— 


1956 


Q 


VAL 


A 231 


53.B64 


-9.834 


36.705 


1.00 21.00 


Q 




ATCM 


1957 


CB 


YAL A 231 


55.541 


-10.057 


39.42€ 


1.00 28.61 


c 




ATCM 


1959 


CGI 


VAIi 


A 231 


54 


.362 


-9.099 


39.61P 


1.00 29.79 


C 


15 


ATCM 


19,59 


QQ2 


.-YAL 


A 231 


55 


.881 


-10.757 


40.677 


l-,00 29,96 


c 




ATCM 


I960 , 


N 


TRP 


A 232 


56 


.133 


-9.798 


36.486 


1.00 17-17 


N 




ATOM 


1961 


CA 


TRP. 


A 232 


56 


.052 


-9.044 


35.262 


1.00 21.52 


c 




ATCM 


1962 


C 


TRP 


A 232 . 


55 


.399 


-9.844 


34.156 


1.00 20.53 


c 




ATOM 


1963 


P 


TRP. 


A 232 


54 


.599 


-9.306 


33.390 


1.00 24.31 


Q 


20 


ATOM 


1964 


CB 




A 232 


57, 


.439 


-9.644 


34,901.. 


1.00 29.88 


C 




ATOM 


1965 




TRP 


A 232 




r430 


-7,943... 


33.500 


1.00 27.65 


c 




ATCM 


1966 


GDI 


TRP 


A 232 


57, 


.194 


-6.464 


33.356 


1.00 25.42 


C 




ATCM 


1967 


C2?2 


TRP 


A 232 


57. 


.714 


-9.336 


32.169 


1.00 27.75 


c 




ATCW 


196B 


NEl 


TRP 


A 232 


57. 


,325 


-6.095 


32.033 


1,00 22.53 


N 


25 


ATCM 


1969 


CE2 


TRP 


A 232 


57. 


655 


-7.203 


31.279 


1.00 25.11 


C 




ATCM 


1970 


CB3 


TRP 


A 232 


59. 


Q37 


-9.603 


31.640 


1.00 22.72 


c 




ATOM 


1971 


CZ2 


TRP 


A 232 


57. 


917 


-7.316 


29,979 


1.00 17.23 


c 




ATCM 


1972 


CZ3 


TRP 


A 232 


59. 


239 


-9.720 


30.223.. 


1.00 25.97 


c 




ATOM 


1973 . 


CH2 


TRP 


A 232 


59. 


154 


-9-591 


29.368 


1.00 22.07 


c 


30 


ATCM 


1974 ,. 


N 


Ii£U 


A 233 


55. 


749 


-11.121 


34,018 


1.00 23.80 


N 




ATCM 


1975 


CA 
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53. 


652 


-12,119 


33.122 


1.00 24.51 


c 




AT<?M 


1977 


9 
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CB 
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C9 
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CA 
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929 


-12.523 
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C 
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c 




ATOM 


1985 


Q 


GLU 


■A 234 ... 
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N 
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5i.ei2 


-6-315 


26 


.662 


1 


,00 


21 


.20 


c 




ATOM 


2040 


c 
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54.947 
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.02 


o 




ATCM 


2012 . 


CB 


SER A 241 
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25 
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51. §§1 
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.87 
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CA 
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-2.548 


27 
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52-591 
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23 
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ATCM 


2051 
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A 242 


53.1^5 
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13 
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ATOM 


2052 
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A 242 


51.629 


-0.887 
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1, 
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17 


.44 
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ATOM 


2053 
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A 242 
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-2.031 
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1, 
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20 


ATOM 


2051 


N 


ILE 


A 243 


54.310 


-1.203 


29. 
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1, 


,00 
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ATOt^ 


2055 


CA 
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A 243 


53.19? 


-0,009 


.30, 
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1. 


,00 
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ATOM 


2056 


Q 


IL& 


A 243 


53.336 
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30, 


.191 


1, 
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.23, 
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o 
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51.312 
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30. 
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1. 


rOO 
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2058 


C& 
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A 243 


54.166 


-1.273 
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1- 
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21, 


.62 


c 


25 


ATCM 


2059 


C<?2 


ILS 


A 243 


51.011 


-2.793 


31. 


.576 


11 


.00 




60 
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ATOM 


?060 




ILfi 


A 243 


53.497 


-0.665 


32.735 


1. 


00 


17. 


,37 


c 




ATOM 


2061 


CDl 


IL& 


A 243 


51.725 


-3.365 


32. 


711 


1, 


00 


11. 


,92 
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ATCM 


20^2 


N 


-ASIL 


A 244 


52.112 


1.217 


30. 


013 


1. 


00 


16. 


,43 
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ATOM 


2063 . 


CA 


A5N 


A 244 


51.824 


2.669 


30. 
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1. 


00 


19. 


,99 


c 


30 


ATCM 


20^1 


c 


A9N 


A 244 


52.252 


3.292 


31. 


316 


1- 


09 


19. 
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ATOM 


2065 


O 


ASK 


A 244 


51.965 


2.727 


32. 


105 


1. 


00 


19. 


58 
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ATOM 


2066 


CB 


ASN 


A 244 


50.304 


2.987 


29. 


910 


1, 


00 


15. 


67 


c 




ATCM 


2067 


C<? 


ASN A 244 


19.769 


2-702 


.29. 
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1. 


00 
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ATCM 


2068 
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ASN 


A 244 


50.546 


2.583 
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64 


o 
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ATC»< 


2069 
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A 244 


18,113 
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29. 
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1, 


00 


10. 


16 
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ATW 


2070 




YAIf 


A 245 


52.800 


1.199 


31. 
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1, 


00 


13. 
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N 




ATOM 


2071 


CA 
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A 215 


53,159 


5.131 


32. 


602 


1. 


00 
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19 
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2072 


C 


VAL 


A 245 


52.528 
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611 


1. 


00 
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ATC»r 


2073 


O 
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A 245 


52.706 
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1. 


00 
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CB 
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A 245 
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A 245 
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A 215 
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1, 


90 
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82 
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N 
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A 246 
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CA 
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0 
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.850 
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THR A 247 


49 


.207 


9 


.161 


31 


.963 
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21 
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THR A 247 


18 


.232 


9 
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21 


.39 
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THR A 247 


16 


.868 


8 
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35 


.673 
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21 
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ATOM 
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8 
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ftTCM 2125 CGI ILB A 253 44.fl62 22 .200 26.561 l.OQ PT.-^ri Q 

AlCM 2125 Cg2 ILB A 253 45.473 24.479 27.077 1.00 ^-22 £ 

2122 QVl ILB A 253 45.662 21 .276 27.452 1.00 49.56 Q 

AISJM 2125 £1 ARg A 25^ 42.637 24.709 29.7 07 1.00 19.56 M 

5 AlflM 2129 CA ARG A 254 42.228 25.865 30.522 1.00 19.41 £ 

ATQrt 213i2 £ ARg A 251 ^2.712 25.713 31.970 1.00 18.10 c 

Alflfcl 2131 9 ARg A 251 43.311 26.616 32.51 5 1.00 13.89 Q 

A1S2M 2132 CB ARg A 251 10,791 2$.1Q1 3Q.480 i.oo 15.98 £ 

AlOtl 2125 gg ARg A 251 40.282 27,378 31,255 1.00 9-96 £ 

10 ATOfcl 2124 CP ARg A 25.1 38.809 27.702 31.218 1.00 24,79 £ 

AICM 2135 MB ARg A 251 38,498 28,414 29.997 1.00 29.42 u 

AIS3M 213g eg ARg A 251 38,693 29.723 29,794 1.00 59.85 £ 

AICM 2122 mi ARg A 251 39.194 30.527 30 ,732 1.00 42.58 If 

ATOM 2136 WH2 ARg A 251 38.377 30.245 26.620 1.00 18.44 U 

15 AISM 2139 N ASP A 255 42.406 24.564 32.58 6 1.00 20,22 ft 

Ami 211P CA AgP A 255 42,795 24.205 33.974 1.00 16.48 £ 

AIQfcl 2111 £ AgP A 255 44.321 24.372 34.069 1.00 22.43 £ 

AlCai 2112 Q ASP A 255 44.868 24,897 35.060 1.00 18.53 Q 

AlSai 211? eg ASP A 255 42.479 22.686 34.157 1.00 19.17 £ 

20 AlfiM 2111 eg ASP A 255 42.144 22.246 35.610 1.00 47-08 £ 

AICM 21iL5 9P1 ASP A 255 41.780 23.090 36.429 1.00 49.66 Q 

AlflM 21±6 9P2 ASP A 255 42.020 21.016 35,880 1,00 48.12 2 

AICM 2117 W LEV A 25^ 45.014 23.809 33.078 1.00 15.98 U 

AICM 2115 CA LBV A 256 46,465 23.844 33.069 1.00 21.76 £ 

25 Aiai 2119 C LSV A 25^ 47.020 25.275 33.076 1.00 16.79 £ 

AICM 21Sfi Q LBV A 256 47.825 25.697 33,946 1-00 15.24 Q 

Aim 2151 CB LBV A 256 46.967 23.056 31.859 1.00 23.33 £ 

Aiai 21S2 C2 LEV A 256 48.491 23.100 31.765 l.QD 26.80 £ 

AlCai 2153 CDl LEV A 256 49.171 22 .334 32.964 1.00 17.13 £ 

30 ATOn 2151 CP2 LBV A 256 49,040 22.724 30.346 1.00 15.42 £ 

Ama — 2ii5 — n — ala a 257 46.520 26.043 32.140 1.00 13.77 m 

AlSai 21ii CA ALA A 257 46.93B 27 .436 32.025 1.00 12.70 £ 

AlflM 2157 C ALA A 257 46.656 28 .237 33.267 1.00 10.73 £ 

Aiai 2158 Q ALA A 257 47.451 29 .073 33.672 1.00 20.33 Q 

35 AlCM 215? CB ALA A 257 46.208 28.073 30.834 1-00 13.34 £ 

AigCl 216Q M gLH A 258 45.470 28.080 33.835 1.00 12.40 M 

AlCM 21^1 CA gLM A 259 45.102 28 .911 34.981 1.00 6.39 £ 

AXCM 21£2 C gLN A 258 45.879 28 .480 36.166 1.00 13.48 £ 

Aim 2163 Q QUi A 25.8 46.178 29.281 37.029 1.00 22.96 Q 

40 ATCM 21^1 CP gLK A 259 43.614 28.761 35.305 1.00 16.12 £ 

Ami 21§5 eg gLW A 258 12.671 29.096 31,130 1.00 30.19 £ 

AIQM 2166 CP gLM A 259 42.574 30.585 33.761 1.00 37.29 £ 

hSSSl 21j£2 OEl GLN A 2 58 42.911 31.471 34.610 1.00 21.24 £ 

AlSaa 2168 HE2 gLtf A 259 12,021 3P,976 32.572 l-QQ 15.94 u 

45 Aiai 2169 N THR A 259 46.179 27.182 36.232 1.00 16-21 H 
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Ami 217P Cft TOR A 259 ^^-982 26.678 37.336 1.00 ^fi^n^ £ 

aXCai 2171 C THR A 25g 46.410 27.186 37.23 3 1.00 :?Q,S6 £ 

AlCM 2122 2 THR A 259 49.002 27.621 38.214 1.00 21.44 Q 

AICM 2123 CS THR A 25.g 47.066 25.192 37.3 61 l.QQ 27-56 £ 

5 AICM 2124 Wl TOR A 2^9 45,752 24,620 37.509 1.00 20.92 Q 

AlSaj 2175 Cg2 TOR A 259 47.936 24,796 38.5 45 1.00 12.65 £ 

AICM 217^ ti ILE A 2^P ^g,952 27.170 36 .028 l.OO 19.96 u 

AlfiM 2177 CA ILB A 2$g 50.292 27. 704 35.839 l.QQ 23.01 £ 

Aiai 212S £ ILE A 260 50.313 29,180 36.225 1.00 31.73 C 

10 AlfiM 217? 0 ILE A 260 51.211 29.627 36.993 l.QQ 25.90 Q 

AlfiStl 2180 CB ILJB A 260 50.835 27.456 34.390 1.00 22.46 £ 

AISM 2181 Cgl ILE A 26Q 51.153 25.940 34.232 l.QQ 24.12 £ 

ATSaM 21&2 QQ2 ILB A 260 52.099 28.361 34.1 06 l.QQ 13.47 £ 

AlfiM 2163 CDl TI.R A 2 60 51.501 25.443 32.810 1.00 12.58 £ 

15 aiCM ZIM n ALA A 261 49.280 29.910 35.764 1.00 15.35 H 

AlQfcl Zl^ £A ALft A 261 49.177 31.355 36.048 1.00 16.00 £ 

AI£3M ZIM C AIA A 261 49. 316 31.604 37.550 1.00 20.58 £ 

AlSai 2ia2 Q ALA A,2.61 50.104 32 .443 37.987 1.00 16.09 Q 

AIOM 21M ALA A 261 47.832 31.958 35.487 1.00 13.65 £ 

20 Aim 218? Lyg A 262 48.551 30,843 38.323 1.00 11.50 H 

AlQfcl 2122 £A LYS A 262 48.578 30.905 39.770 l.QQ 10.13 £ 

AISM 2121 C LYS A 262 49.968 30 .460 40.296 1.00 28.08 £ 

AlSaj 2122 Q LYS A 262 50.503 31.084 41.205 1.00 29.37 Q 

AIQM 2123 CB LYS A 262 47.453 30.032 40.335 1.00 12.50 £ 

25 Aiai 2124 — CS — LYg A 262 47.332 29.962 41.888 l.QQ 16.51 £ 

AISM 2125 Zn LY? A 2 6? 46.092 29,092 42,371 1.00 46.61 £ 

AJSM 2126 £E LYS A 2 62 46.344 27.555 42.661 1.00 99.70 £ 

AXffll 2122 — UZ — LYS A 262 15,157 26.703 43 .200 i.oo 36.59 ^ 

Aim 212fi U VAL A 263 50-589 29.443 39.705 1.00 17.44 N 

50 Aim 2122 CA VAL A 26? 51.915 29.039 40.171 1.00 18.72 £ 

Aim 22P9 C VAL A 263 52.997 30.170 39.997 1.00 32.12 £ 

Al<2fcl 2221 Q VA^ A 263 53,971 30.412 40 ,B34 l.OO 21.18 Q 

Aim 2202 CB VAL A 263 52.389 27.709 39.476 1.00 16.35 £ 

Aim 2223 C<?1 VAL A 263 53.920 27,518 39,647 1.00 11.83 £ 

55 Aim 222i Cg2 VAL A 263 51.646 26 ,522 40.093 1.00 14.99 £ 

Aim 2225 n VAL A 264 52.913 30 .899 3B.909 1.00 21.75 ^ 

Aim 2226 CA VAL A 264 53.917 31.877 3B.6 53 1-00 19.81 £ 

Aim 2222 C VAL A 264 53.719 33.208 39.377 1.00 35.79 £ 

Aim 222fi Q VAL A 264 54.632 34.032 39.482 1.00 28.99 q 

^0 Aim 2222 CB VAL A 264 54.059 32.014 37.175 1.00 24.27 £ 

Aim 221ii Cgl VAL A 264 54.728 33.269 36.822 1.00 33.58 £ 

AIQtl 2211 Cg2 VAL A 264 54.840 30.808 36.674 1.00 23.01 £ 

Aim 2212 N GLY A 265 52.550 33.378 39.969 1.00 25.30 ^ 

Aim ZZ12 CA gLY A 265 52^211 34.620 40.636 l.OO 24.14 £ 

^5 Aim 2214 £ GLY A 265 51.73 0 35.694 39.632 1 .00 35.03 £ 
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ATCM 2215 O gLY h 265 51.773 36.911 39.962 1.00 33-71 Q 

AIJJM 221g W TYR A 266 51.294 35.257 38.428 1.00 26.25 M 

ATgW 2212 S^A TYR A 266 ^Pi699 36.151 37 .373 l.OO 26.55 £ 

AICM 221g TYR A 266 49.364 36.745 37 .818 1.00 31.01 £ 

5 AICM 221? <? TYR A 266 4B.532 36.067 38.456 1.00 27.99 Q 

Aaai 2220 CB TYR A 266 50.501 35.463 36.006 1.00 24.31 C 

MCM Z2Z1 eg TYR A 266 49.994 36.381 34.8 94 1.00 28.64 £ 

2222 GDI TYR A 266 50.670 37.582 34.542 1.00 35.05 C 

AlSan 2Z22 CD2 TYR A 2 66 48.860 36. 038 34-118 1.00 22.60 £ 

10 AIS3M ZZ2A Cai TYR A 266 50.212 38.434 33.472 1.00 20.73 £ 

AISM 2225 CB2 TYR A 266 19,429 36.859 33 .012 l.oo 20.91 £ 

AIS3M 222J5 £2 TYR A 266 49,086 38,062 32.735 1.00 23.85 C 

ATOM 2222 QH TYR A 266 49,622 38.851 31.710 l.oo 33.40 q 

ATOM 222fl N LYS A 267 49.217 38 .043 37.604 1.00 25.72 U 

15 2222 £A LYS A 267 47.988 38.697 38.009 1.00 30.77 £ 

AXSM £ LYS A 267 47.217 39.280 36.798 1.00 28. B5 £ 

2221 Q LYP A 267 46.179 39.894 36. 949 1.00 31.17 £ 

AlSai 2222 £2 LYS a 2^7 48.279 39.741 39.092 1.00 27.13 £ 

AICM 2212 £G LYS A 267 48.728 39.128 40. 403 1.00 23.18 £ 

20 Alfiai Z2M £12 LYS A 267 48.420 40.096 41. 562 1.00 30.98 £ 

AI25 2235 CE LYS A 267 47.933 39.358 42.820 1.00 48.52 £ 

AISM 2236 H2 LYS A 267 47.005 38.208 42.505 1.00100. 00 u 

AlSai 2237 N gLY A 269 47t716 39,954 35.594 l.oo 22.67 u 

AICM 2239 £A GLY a 2 68 47.019 39 .518 34.394 1.00 21.38 £ 

25 AICM 2239 £ gLY A 268 45.656 38.568 34,085 1.00 31.03 £ 

Alfili 224fi Q gLY A 268 45.455 37.728 34.911 1.00 19.71 O 

AlOtl 2241 N ARG A 269 45.387 38.645 32.849 1.00 30.40 U 

AICM 2242 CA ARg A 269 44.263 37.846 32.399 1.00 26.47 £ 

Aiai 2243 C ARG A 269 44.680 36.705 31.469 1.00 22.35 £ 

30 2244 P ARG A 269 45.378 36.926 30.524 1.00 22.75 £ 

Aiai 2245 CB ARG A 269 43.297 38.753 31.626 1.00 22.65 £ 

AISM ZZM gg ARg A 269 42.201 39.390 32.463 1.00 24.21 £ 

AICM 2247 CP ARg A 269 40.936 39.465 31.568 1.00 83.45 £ 

AlfiM 221S NB ARg A 269 40.113 40.676 31.762 1.00100.00 u 

35 AlCai 2242 £2 ARg A 269 38.808 40.751 31.431 I.OOIOO.OO £ 

AlSai 225fl NHl ARG A 269 38,201 39,691 30.921 1.00 99.93 U 

ATSEl 2251 WH2 ARg A 269 38.094 41.865 31.663 1.00100.00 U 

AaiCSCl 22S2 U VAL A 270 44,195 35,494 31.759 l-QQ 19.67 n 

AICM 2253 CA VAX. A 270 44.469 34.389 30.656 1.00 24.82 £ 

40 AICM ZZ^ £ VM^ A 270 43.319 34.456 29.824 1.00 22.51 £ 

AX2M 22^5 Q VAL A. 270 42.145 34.501 30.181 1.00 25.79 Q 

Aiai 2255 CB y?u, A 27Q 44.4?6 32,979 31,571 IrPQ 24. Q3 £ 

AIQfcl 2257 <;gl VAL A 270 44.576 31.661 3P,533 1.00 20.72 £ 

ATC^ 2258 CG2 VAL A 270 45.506 32 .849 32.639 1 . 00 11.27 £ 

45 m:2M 22^2 H VAL A Zll 43.660 34.409 29.554 i.QQ 25.18 u 



wo 99/64618 PCT/US99/1 1576 

132 





ATOM 


22^0 


- CA 


VAL A 271 


42.666 




«492 


27 


.497 


1 


.00 


29.32 


c 




AXQK . 


22fil 


C 


VAL A 271 


42.819 




.370 


26 


.442 


1 


.00 


24.99 


C 




ATOM . 


2262 


Q 


VAI, A 271 


43,?23 


33 


.115 


25 


.990 


1 


,00 


21,99 


0 




ATOM 


2263 


CB 


VAi A 271 


42.901 


35 


.913 


26 


.736 


. 1 


,00 


29.25 


c 


5 


ATOM 


22«1 




VAL A 271 


^2.256 


35 


.773 


25 


.370 


1 


.00 


31.91 


c 




ATCM 


226? 




VAI> A 271 


^2,421 


36 


.999 


27 


.565 


1 


,00 


19.72 


C 




ATCM 


2266 


|] 


fH£ A 272 


41.716 


32 


.758 


26 


.019 


1 


.00 


26.14 


N 




ATOM 


2267 


CA 


PHE A 272 


41.752 


31.747 


24.963 


1.00 


24.34 


C 




ATW1 


2269.. 


c 


PHE A 272 


41.236 


-32 


.266 


23 


.623 


1 


.00 


28.95 


c 


10 


ATOM 


226? 


0 


PHE A 272 


40.155 


32 


.926 


23 


.592 


1 


,00 


22.01 


0 




ATCM 


227P 


CB 


PHE A 272 


40,960 


30 


.506 


25 


.391 


1 


.00 


20.97 


c 




ATC« 


2271 


QQ 


PHE A 272 


41.764 


29.570 


26 


.243 


1,00 


21.77 


c 




ATCM 


2272 




PUS A 272 


41.940 


. .29 i 


i942 


27 


.610 


1 


.00 


14.60 


c 




ATC»1 


2273 


CP2 


PHE A 272 


42.504 


29 1 


.550 


25, 


.656 


1 


.00 


22.19 


c 


15 


ATCM 


2274 


CBl 


PHE A 272 


42.763 


29, 


.041 


29, 


.434 


1 


,00 


17.89 


c 




ATCM 


2275 


CB2 


PHE A 272 


43.336 


27, 


,726 


26, 


,454 


1 


,00 


27,64 


c 




ATWl 


2276 . 


CZ 


PHE A 272 


43.479 


27, 


.979 


27, 


.951 


11 


.00 


25.14 


c 




ATCM , 


2277 


N 


ASP A 273 


42.012 


32. 


rli4 


22, 


.542 


1, 


,00 


29-45 


N 




ATCM 


2275 


CA 


ASP A 273 


41.557 


32, 


,536 


21, 


,214 


1 


,00 


22.33 


c 


20 


ATCM 


2279 


C 


ASP A 273 


40.896 


31, 


,365 


20. 


,493 


1. 


,00 


25.67 


C 




ATCM 


2280 


9 


ASP A 273 


41t539 


30, 


,579 


19. 


r793 


11 


.00 


17,91 


P 




ATCM 


2291 




ASP A 273 


42-672 


33. 


,114 


20. 


r343 


1, 


rOO 


21.45 


c 




ATCM 


2292 


TO 


ASP A 273 


42,131 


33. 


626 


18. 


.990 


11 


.00 


26,99 


C 




ATCM 


., 22P3 


C>£>1 


ASP A 273 


40.975 


33, 


249 


18.598 


11 


,00 


27.76 


9 


25 


ATtM 


2284 


PP2 


ASP A 273 


42.8?8 


34, 


421 


19. 


,327 


1, 


,00 


30.06 


0 




ATCM 


2285 


N 


ALA A 274 


39.589 


31, 


284 


20. 


649 


1. 


.00 


15.59 






ATOTf 


2296 


CA 


ALA A 274 


38.932 


30. 


129 


20. 


128 


1.00 


23-75 


c 




ATCM 


2297 




AZiA A 274 


38.853 


30. 


169 


19. 


653 


1. 


.00 


32.30 


c 




ATC»f 


2288 


0 


ALA A 274 


39.294 


29. 


256 


19. 


029 


1. 


,00 


29.37 


0 


30 


AT^M 


?29? 


CB 


ALA A 274 


37.567 


29. 


905 


20. 


777 


1. 


,00 


18.87 


c 




ATOM 


22?o 


N 


SER A 275 


3?, 372 


31, 


243 


19. 


091 


1. 


00 


21,10 


N 




ATCM 


2291 


C^L 


SER A 275 


39.343 


31, 


299 


16. 


631 


1. 


,00 


26.90 


C 




ATOM 


22?2 




SBR A 275 


40.3?0 


30, 


300 


16. 


116 


1. 


00 


43.37 


C 




ATCM 


22?3 




SER A 275 


40.421 


29. 


:949 


14. 


927 


1- 


00 


46.32 


0 


35 


ATOM 


2294 


CB 


SER A 275 


39,547 


32.683 


16.074 


1. 


00 


15.19 


c 




ATOM 


22?5 


OG 


SER A 275 


40.904 


33, 


070 


16. 


079 


1, 


OP 


28.71 


0 




ATCM 


2296 


N 


LYS A 276 


41,192 


29, 


790 


17, 


037 


1. 


00 


22,99 


N 




ATCM 


22?7 


CA 


LYS A 276 


42.178 


29. 


791 


16, 


639 


1. 


00 


23.28 


c 




ATCM 


2298 


C 


LYS A 276 


41.643 


27, 


405 


16. 


976 


1. 


00 


29-73 


C 


40 


ATOM 


2299 


0 


LYS A 276 


40.992 


27. 


206 


19. 


010 


1, 


QP 


25.10 


P 




ATOM 


2300 


CB 


LYS A 276 


43.544 


29. 


051 


17. 


275 


1. 


00 


19.19 


C 




ATOM 


2301 


C2 


LYS A 276 


43.957 


30. 


496 


17. 


219 


1, 


00 


32.11 


c 




ATOM 


2302 


CP 


LYS A 276 


44.062 


30. 


852 


15, 


799 


1. 


00 


22.43 


c 




ATCM . 


2303 


CE 


IiXS A 276 


44.930 


32. 


067 


15, 


57Q 


1, 


OP 


23.18 


C 


45 


ATQrt 


2304 


HZ 


LYS A 276 


45,454 


32, 


117 


14, 


15? 


1, 


00 


29,42 


1^ 



wo 99/64618 PCT/US99/11576 

133 







2305 


. K 


PRO A 277 


11 


,.8?2 


26.176 


16.055 


1 


.00 36.04 


N 




ATCW 


2306 


CA 


PRO A 277 


11 


.116 


25.087 


16.170 


1 


.00 35.93 


c 




ATCM 


2307 


C 


PRO A 277 


12 


.922 


21.332. 


17.363 


1 


.00 29.30 


c 






2308 


0 


PRO A 2 77 


43 


.103 


24.650 


17.985. 


.1 


.09 30.51 


Q 


5 


ATOM 


230? 


Cfi 


PRO A 277 


41 


.?75 


21.153 


11,979 


1 


,00 39.65 


c 






2310 


. X6 


PRO A 277 


43 


.21? 


25.261 


14.566 


1 


.00 12. ?0 


C 




ATCM 


2311 


. CP 


PRO A 277 


12, 


.797 


26.679 


11.8?2_ 


1 


.00 :?7-B4 


C 




ATOM 


2312- 


N 


ASP A 278 


11 


.273 


23.339 


17.809 


1.00 22.35 






ATOM 


2313 


CA 


ASP A 278 


11 


.715 


22.501 


19.?03 


1 


..00 22.16 


c 


10 


ATOM 


231^ 


c 


ASP A 278 


42 


.191 


21.189 


18.272 


1 


.00 19.66 


c 




AT«1 


231$ 


Q 


ASP A 278 


• 11 


.305 


29.?17 


17.117- 


1 


.00 23.49 


0 




ATOM 


231$ 


CB 


ASP A 278 


10 


.635 


22.211 


1?.?71 


1 


.00 15.09 






ATOM 


2317 


C^ 


ASP A 278 


10 


,216 


23,503 


29.702 


1 


.00 22.96 


c 




ATCM 


2319 


ODl 


ASP A 278 


11 


.113 


21.251 


21.096 


1: 


.00 25.18 


o 


15 


ATOM 


231?... 


op2 


ASP A 278 


38 


.??? 


23.787 


29.912 


1 


.00 39.55 


O 




ATOM 


2320 


W 


GLY A 279 


12 


.916 


20.355 


19.044 


1 


.00 30.65 


H 




ATOM 


2321 


CA 


GLY A 279 


43 


.22? 


1?.931 


19.516 


1, 


.90 33.79 


c 




ATC>M 


2322 . 


C 


GLY A 27? 


12 


..11$ 


18.099 


19.?11, 


1, 


.00 38.10 


c 




AXCM 


2323 


0 


GLY A 279 


40 


.963 


18.517 


1?.06B 


1. 


.00 47.52 


o 


20 


ATM 


2324 


a 


THR A 280 


42.419 


16.93? 


19.177 


1, 


.00 29.44 


N 




ATOM 


232$ 


_£A.. 


THR A 280 


41, 


,328 


15.990 


19.587 


li 


.90 26.69 


C 




ATOM 


2326 


C 


THR A 280 


lo- 


.99? 


16.13? 


29.?72 


1 . 


OO 2:^.57 


c 




ATOM 


2327 


o 


THR A 280 


ll, 


,67P 


17.967 


21.713 


1.00 23-62 


o 




ATOM 


2328 


CP 


THR A 280 


41, 


.695 


11i1?2 


1?.510 


1. 


,00 40.78 


c 


25 


ATOM 


232? . 


O&l 


THR A 280 


42. 


889 


14.272 


29.2?6 


1. 


,00 25.56 


0 




ATOM 


2330 


C<?2 


THR A 280 


41.893 


11.951 


19.0?5 


1. 


00 37.71 


C 




AXW 


233J 


N 


PRO A 281 


3?. 


672 


16.063 


21.346 


1.00 25. S4 


N 




AlW .. 


233? ,. 


CA 


PRO A 281 


3?. 


12? 


16.151 


22.629 


1. 


00 25.72 


C 




ATW 


2333 


c 


PRO A 281 


3?. 


776 


15.779 


23.800 


1. 


00 26.02 


c 


30 


ATCW 


2334 


0 


PRO A 281 


3?. 


752 


16.311 


21.?15_ 


1, 


00 22.68 


o 




ATOM . 


2335 


CB 


PRO A 281 


37. 


650 


15.990 
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23.587 


1. 


00 27.88 
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00 27.62 


C 




ATOM 


2311 


0 


ARG A 282 


41. 


?13 


12.192 


23.425 


1. 


00 23.83 
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ATOM 
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ATC»1 
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AICM CPl LBV A 293 53.306 -4 .170 35.836 l.QQ ?8.?5 £ 

CP2 IiEV A 293 52.965 -6. 110 37.439 1.00 47.81 Q 

AISM n gLY A 291 51.316 -2 .510 41.111 l.OO 33.08 M 

AlfiM 2M2 CA cay A 291 51.4B8 -1. 793 42.347 1.00 24.90 £ 

5 AlCM 2M4 C gl,y A 291 52,272 -0 .512 42.326 1.00 29.31 £ 

AISM 2115 Q gLY A 291 53.070 -0.249 43.223 1.00 25.25 Q 

AlSai 2115 M TRP A 29? 52.000 0 ,347 41.368 1.00 27.83 U 

A12M 2117 CA TRP A 29? 52,697 1 .623 41.385 l.QQ 19.45 £ 

AISJM 211fi Q TRP A 295 51,^81 2.731 41.081 l.OO 25.79 £ 

10 AlCM 2112 Q TRP A 29$ 50.765 2.527 40.297 1.00 2n,i^ 3 q 

AISM 215Q CB TRP A 295 -53.961 1.614 40.524 1.00 12. R5 £ 

AISM 2111 eg TRP A 295 54.750 2.911 40.618 l.OO 23.04 £ 

A3K3ai 2152 CPl TRP A 295 55.897 3 .161 41.368 1.00 23.68 £ 

ATOM 2153 CD2 TRP A 295 54.415 4.159 39.979 1.00 20.72 £ 

15 A12M 2151 HBl TR? A 295 56.258 4.493 41.244 1.00 18.67 

him 2155 — C£2 TRP A 295 55,399 5.113 40.373 i.oo 20.95 £ 

AlSaW 215$ CE3 TRP A 295 53.406 4 .550 39.102 1.00 21.47 £ 

AI2M 2152 gZ2 TRP A 295 55.338 6.439 39.958 1.00 17.58 £ 

AIS2M 2456 CZ3 TRP A 295 53.403 5.873 38.632 1.00 21.57 £ 

20 AlSai 2152 — CH2 TRP A 295 51t3$? 6.787 39.058 1.00 19.45 £ 

AlSabl 21^9 , M TYR A 29$ 51,799 3.797 41.884 l.OO 25.17 u 

Ami 2151 CA TYR A 29$ 50.720 4.883 41.7 31 1.00 24.90 £ 

AISM 2152 C TYR A 29$ 51.517 6 .178 41.857 l.OO 30.85 £ 

Aiaa 2153 2 TYR A 29$ 52.363 6 .272 42.745 1.00 21.27 O 

25 AI<2Sa 2151 CB TYR A 29$ 49.654 4 .813 42.840 1.00 25.18 £ 

AISM 2155 eg TYR A 296 48.685 3 .651 42.744 1.00 23.04 £ 

Aiei3 2466 CPl TYR A 29$ 49.078 2.343 43.068 1.00 31.62 £ 

him 21^7 CP2 TYR A 2 9$ 47.380 3.853 42.289 l.QQ 26.02 £ 

AlCM 215fi CSl TYR A 29$ 48,203 1.268 42.935 1.00 24.42 £ 

30 ATCM 21$9 CS2 TYR A 29$ 46.493 2.770 42.127 1.00 24.81 c 

ATS3M 2170 CZ TYR A 29$ 46.902 1 .483 42.464 1.00 39.41 £ 

AlCM 2121 PH TYR A 29$ 45.984 0 .434 42.337 1.00 66.19 Q 

him 2122 U HI? A 297 51.324 7.123 40. 924 1.00 20.95 ^ 

him 2123 CA HIS A 297 52.130 8.343 40. 938 1.00 26.86 £ 

35 AISM 2124 C HIS A 297 51.947 9,175 42.210 1.00 35.01 £ 

him 2475 O HIS A 297 50.885 9.132 42.874 1.00 26.92 Q 

him 217$ Cfi HIS A 297 51.819 9.192 39.733 1.00 25.77 £ 

him 2177 eg HIS A 297 59,199 9.842 39.803 i.oo 31.16 £ 

ATQtJ 2179 WPl HIS A 297 49.314 9.145 39.633 1.00 34.21 ^ 

40 AICM 2179 CP2 HIS A 297 50.135 11.094 40.167 i.oo 25.83 £ 

him 215i2 CBl HIS A 297 48.290 9.972 39.776 1.00 24.14 £ 

him 2151 HB2 HIS A 19,7$1 11.164 40.087 1.00 23.35 N 

AlCai 2152 n gLV A 29g 52.963 9.926 42.554 1.00 24.98 ^ 

him 2193 CA gLV A 299 52.957 10.683 43.798 1.00 27.65 £ 

45 iiim 2154 C gLV A 299 52.831 12 .187 43.741 l.QQ 36.86 £ 
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AlQfcl 21B5 Q gLV A 2gg 52.433 12. 792 44.718 1.00 43.61 Q 

AlCM 2196 CB gLV A 2?9 54.153 10.319 44-68fi l.QO 2:>.Q2 c 

AlflM 2197 TO gLV A ZSg 54.004 e.943 45.285 1.00 36.42 Q 

AliJM Zlflfi CP gLV A 2gg 54-999 8 .664 46.406 1. 00100.00 £ 

5 AIS2M 21g? QEX gliV A 2?? ^^^222 9.561 46.1 52 l.OO 44.79 Q 

Alfiaa 2£Sfi 0E2 gLV A 299 54.526 8.470 47.547 1.00100.00 Q 

AICM 21?I N ILE A 2?? 53-232 12.800 42-639 1.00 23.49 H 

AIS3M 2192 CA ILE A 29? 53.268 14,244 42.56 2 1.00 13.25 Q 

AISM 2123 Q ILE A 299 52.016 14.848 41.906 1.00 27.05 Q 

10 AlSai 2121 Q ILE A 299 51,681 14.530 40.757 1.00 26,73 Q 

AISM 2125 Cfi ILE A 299 54.586 14.711 41.862 1.00 15,93 c 

A3S2M 219^ Cgl ILE A 299 55.836 14.183 42.606 l.QO 23,83 £ 

AXQSfcl 2127 — Cg2 ILE A 299 51,596 16.213 41 .541 i.QO 17,37 q 

AISM 2196 CDl ILE A 299 57.232 14.221 41.787 l.OO 21,32 Q 

15 AlCM 2199 N ?BR A 300 51.323 15.716 42.648 1.00 18.55 

AICM 25QQ CA SBR A 3PJ2 50.177 16.449 42.091 1.00 19,5ff 5^ 

AI£3M 2SfiLl C SBR A 309 50.714 17.415 41 ,0 42 l.QO 17.29 

AIQM 2£fl2 Q SBR A 3PQ 51.824 17.941 41.1 78 l.OO 21-06 Q 

AIQM 2ifi3 CP SER A 3P9 49.542 17.307 43-1 81 1.00 16.78 £ 

20 A1J2M 25P1 OS SER A 3PP 50.548 17.969 43.923 1.00 75.80 q 

AlSaai 2505 U LEV A 301 49.870 17.755 40-075 1.00 16.13 y 

AIQM 250^ £A leu a 301 50.246 18,675 39-014 l.QO 17.70 q 

AISSM 2£fi2 — £ LBV A 301 50. 689 19,964 39.646 1,00 20.11 q 

AISM 2521 O LEV A 301 51.714 20,568 39.303 l.QO 20.46 Q 

25 ATSai 2509 gfl LEV A 30a 48,990 18 .981 38.197 1.00 17.92 q 

AlSai 251P eg LEV A 301 49.182 20 ,030 37-112 1-OQ 25.15 q 

AlStl ZSlH CPl LBV A 3P1 5Pt233 19-552 36,086 l.OO 18.82 c 

AIGM 2513 CD2 LBV A 301 47.854 20,177 36-436 1.00 25.89 q 

AICM 2513 n gLV A 302 49,845 20.399 40.554 1 . QQ 27.01 n 

30 AlSai 2^11 gA gLV A 302 50.053 21 ,636 41.280 l.QO 37.72 c 

AI£3M 2S15 £ gLV A 302 51.410 21.618 41.996 l.QO 29.99 _q 

AlCai 2S16 Q gLV A 302 52.245 22-514 41.798 1.00 27.15 Q 

AIS2I 2517 Cfi gW A 30? 48.899 21-841 42.275 1.00 43.10 q 

AI21 2519 ££ GLU a 302 49,061 23,061 43.174 1.00 90.85 q 

35 Aiai 2^12 on gLV A 302 49,451 24.324 42.580 1.00100. 00 £ 

AlSai 2525 QEl gLV A 302 47.566 24.209 41.706 1. 00100-00 q 

2S21 QS2 gLV A 302 1B,g09 25.432 43.036 l.OO 64.50 Q 

AXflM 2522 n ALA A 303 51.646 20 .591 42.801 l.QO 8-72 M 

AlfiM 2523 CA ALA A 303 52.937 20 .455 43.459 1.00 15.03 Q 

^0 2521 C ALA A 303 54.102 20 .355 42.450 1.00 19.85 £ 

AICM 2525 Q ALA A 303 55.104 21 , 090 42.553 1.00 22,24 Q 

AlCM 252J5 £fi ALA A 3Q3 52.938 1 9.258 44.410 1.00 18.97 2 

AlCM 2527 M gLY A 301 53.953 19.472 41.467 1.00 13.05 ^ 

Aiai 2528 t;^ (^LY A 301 54.970 19 ,321 40.448 1.00 8.94 q 

A3U2i 2529 g g L Y A 30^ 55.239 20.621 39.695 1,00 20.31 c 
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253Q Q GLY A 301 56.3?^ 20.900 39.322 i.no i4.:^o q 

hJSSl U LBV A 3Q? 54,193 21,383 39.361 i. oo lo.ie h 

ATCM gft LBV A 30? 54.483 22.622 38.61 1 1.00 20.29 £ 

ai£3M 2i23 C LBV A 305 55,291 23.669 39.456 l.OO 28.92 £ 

5 ATC»4 2534 O LEU A 305 56.194 24.385 38.974 1.00 17.69 Q 

AISM 2i35 Cfi LBV A 305 53.202 23.245 38.0 33 1.00 24.03 £ 

AIS3M 2ilfi CSz LBV A 30.5 52.357 22.647 3 6.880 1.00 27.66 £ 

AlSaaa 2537 CTI LBV a 305 50.975 23.384 36.789 1.00 13.44 £ 

A1S3M 251fl Cg2 LBV A 305 53.079 22.724 35.5 43 1.00 18-39 £ 

10 ATOM 2ijl2 H AIA a 306 54.904 23.757 40.7 24 1.00 19.94 

AlflM 2510 CA ALA A 306 55.544 24.660 41. 655 1.00 24.79 £ 

AICM 2^40 C ALA A 3Qg 57.035 24.380 41.7 43 1.00 27,51 £ 

ATOM Q ALA A 30^ 57.852 25.280 41.662 1.00 29.68 _Q 

ai£3M ZM2 CB ALA A 306 54.937 24.471 43.002 1.00 17.87 £ 

15 AIS3M 2i44 U gBR A 307 57.378 23.137 42.0 11 1.00 1fl.4fi u 

ATCW 2515 CA SER A 307 58.793 22.756 42.162 1.00 16.31 £ 

ATCTl 2516 £ SER A 307 59.547 22 .885 40.832 1.00 22.66 £ 

A1S21 2517 Q SER A 307 60.742 23.212 40.786 1.00 28.47 Q 

AISM 2519 £fi SER A 307 58.851 21.304 42.622 1.00 20.47 £ 

20 AIS2M 251? 02 SER A 307 58.517 20.454 41.526 1.00 29.03 Q 

hXm 2550 U THR A 308 58.849 22.631 39.735 1.00 27.31 H 

AXSaM 255; CA THR A 308 59.458 22 .738 38.413 1.00 22.89 £ 

aiSM 2552 C IHR A 309 59.757 24.216 38,107 1.00 26.06 £ 

AISM 2553 Q THR A 309 60.819 24.546 37.5 91 1.00 29.89 Q 

25 AISaM 2^ gg THR A 309 58.536 22.115 37.3 18 1.00 18.72 £ 

AIS3M 2555 Pgl THR A 309 58.356 20.714 37-545 1.00 20.17 O 

ATOM 2556 CG2 THR a 308 59.094 22.330 35.923 1.00 12.37 £ 

AIGM 2557 H TYR A 30? 59,91$ 25,119 38.453 i . 00 28.20 u 

AISM 2559 CA TYR A 309 59.110 26 .549 38.241 1.00 31.09 £ 

30 AlfiM 25i5 C TYR A 309 60.363 27.059 39.0 45 1.00 16.31 £ 

AXQM 2560 Q TYR A 309 61.179 27.858 38.577 1.00 16.91 o 

AlCai 2i£l CB TYR A 3Q9 57.819 27.373 38.533 1.00 31.19 £ 

ATOM 2562 CS TYR A 309 57.944 28.895 38.392 1.00 14.57 C 

ATSai 2563 CPl TYR A 309 58.397 29 .457 37.224 1.00 17.51 £ 

35 aiQM 2561 CP2 TYR A 309 57.575 29.757 39.442 1.00 24.99 £ 

AlSai 2565 CEl TYR A 309 58.527 30.801 37.100 1.00 18.41 £ 

ATOM 2566 CB2 TYR A 309 57.744 31.129 39.351 1.00 19.04 £ 

ATOM Zh32 £2 TYR A 309 59,212 31.641 38.164 i.oo 29.13 £ 

ATOM 25 69 O H TY R A 309 58.300 33.004 37.966 1.00 28.22 Q 

40 Alfitl 2569 N GI^ A 310 60.560 26.579 40.260 1 . 00 15.41 H 

AIS2M 2570 CA GLN A 310 61.7Q5 26.964 41.087 1.00 ?2.35 £ 

ATCT1 2571 £ gLN A 310 63.001 26.492 40.446 1.00 :^1.46 £ 

ATOM 2572 O GXN A 310 64.009 27.191 40.442 1.00 33.42 O 

hTOn 2573 £E gLN A 310 61.587 26.335 ai,j[^2 1.00 17.67 C 

45 ATOH 25 74 CG GLN A 310 62.579 26 .921 43.461 1 . QQ .S7.5R £ 
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While various embodimmts of the present invention have been described in detail, 
it is apparent that modifications and ada pt a t ions of those embodiments will occur to those 
stalled in the art. However, it is to be expressly understood that such modifications and 
adaptations are within the spirit and scope of the present invention, as set forth in the 
5 following claims. 
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What is claimed : 

1 . A method for produdng ascorbic add or esters thereof in a microorganism, 
comprising culturing a microoiiganism having a genetic modification to increase the action 
of an enzyme selected from the group consisting of hexokinase, glucose phosphate 

5 isomerase, phosphomannose isomerase, phosphomannomutase, GDP-I>-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-y-l^one dehydrogenase; and recovering said ascorbic acid or esters thereof 

2. A method, as claimed in Claim 1, wherein said genetic modification is a 
10 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-D-maimose: GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1-P-phosphatase, L-galactose dehydrogeimse, and L- 
galactono-y -lactone dehydrogenase. 

3. A method, as claimed in Claim 1, wherein said genetic modification is a 
15 genetic modification to increase the action of an epimerase that catalyzes conversion of 

GDP-D-mannose to GDP-L-galactose. 

4. A method, as claimed in Claim 3, wherein said genetic modification is a 
genetic modification to increase the action of GDP-D-mannose:GDP-L-galactose 
epimerase. 

20 5. The method of Claim 3, wherein said genetic modification comprises 

transformation of said microorganism with a recombinant nucleic acid molecule that 
expresses said epimerase. 

6. The method of Claim 5, wherein said epimerase has a tertiary structure that 
substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

25 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. 

7. The method of Claim 5, wherein said epimerase has a structure having an 
average root mean square deviation of less than about 2.5 A over at least about 25% of 
Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-marmose epimerase/ 

30 reductase represented by atomic coordinates having Brookhaven Protein Data Bank 
Accession Code Ibws. 
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8. The method of Claim 5, wherein said epimerase has a tertiary structure 
having an average root mean square deviation of less than about 1 A over at least about 
25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 

5 Bank Accession Code Ibws. 

9. The method of Claim 5, wherein said epimerase comprises a substrate 
binding site having a tertiary structure that substantially conforms to the tertiary structure 
of the substrate binding site of a GDP-4-kcto-6-deoxy-D-mannose epimerase/reductase 
represented by atomic coordinates having Brookhavai Protein Data Bank Accession Code 

10 Ibws. 

10. The method of Claim 9, wherein said substrate binding site has a tertiary 
structure with an average root mean square deviation of less than about 2.5 A over at least 
about 25% of Ca positions of the tertiary structure of a substrate binding site of a GDP-4- 
keto-6-deoxy-D-mannose ^imerase/reductase represented by atomic coordinates having 

1 5 Brookhaven Protein Data Bank Accession Code Ibws. 

1 1 . The method of Claim 5, wherein said epimerase comprises a catalytic site 
having a tertiary structure that substantially conforms to the tertiary structure of the 
catalytic site of a GDP-4-keto-6-deoxy-D-maruK)se epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. 

20 12. The method of Claim 1 1, v^erein said catalytic site has a tertiary structure 

with an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ca po^ons of the tertiary structure of a catalytic site of a GDP-4-keto-6-deoxy- 
D-marmose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code Ibws. 

25 13. The method of Claim 1 1, wherein said catalytic site comprises the amino 

acid residues serine, tyrosine and lysine. 

14. The method of Claim 13, wherein tertiary structure positions of said amino 
acid residues serine, tyrosine and lysine substantially conform to tertiary structure 
positions of residues SerlOT, Tyrl36 and Lysl40, respectively, as represented by atomic 

30 coordinates in Brookhaven Protein Data Bank Accession Code Ibws. 

15. The method of Claim 5, wherein said epimerase binds NADPH. 
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16. The method of Claim 5, wherein said q)imerase comprises an amino add 
sequence that aligns with SEQ ID NO: 1 1 using a CLUST AL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 50% of non-Xaa residues in SEQ ID NO:l 1. 
5 17. The method of Claim 5, wherein said epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO; 1 1 uang a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 75% of non-Xaa residues in SEQ ID NO: 1 1 . 

18. The method of Claim 5, wherein said epimerase comprises an amino acid 
1 0 sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 

amino acid residues in said amino acid sequence align with 100% identity with at least 
about 90% of non-Xaa residues in SEQ ID NO:l 1. 

19. The method of Claim 5, wherein said epimerase comprises an amino add 
sequence having at least 4 contiguous amino add residues that are 100% identical to at 

1 5 least 4 contiguous amino add residues of an amino add sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID 
NO: 10. 

20. The method of Claim 5, wherein said recombinant nucldc acid molecule 
comprises a nucleic acid sequence comprising at least about 12 contiguous nucleotides 

20 having 100% identity with at least about 12 contiguous nucleotides of a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9. 

21. The method of Claim 5, wherdn said epimerase comprises an amino acid 
sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gty. 

25 22. The method of Claim 5, wherdn said recombinant nucleic acid molecule 

comprises a nucleic add sequence that is at least about 15% identical to a nucldc add 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard defoult parameters. 

30 23. The method of Claim 5, wherein said recombinant nucleic acid molecule 

comprises a nucldc add sequence that is at least about 20% identical to a nucleic acid 
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sequence selected from the group consisting of SEQ ED NO: 1, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

24. The method of Claim 5, wherdn said recombinant nucleic acid molecule 
5 comprises a nucleic acid sequence that is at least about 25% identical to a nucleic acid 

sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard de&uh parameters. 

25. The method of Claim 5, wherein said recombinant nucleic acid molecule 
10 comprises a nucleic add sequence that hybridizes under stringent hybridization conditions 

to a nucleic add sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase. 

26. The method of Qaim 25, wherein said nuddc add sequence ^coding said 
GDP-4-keto-6-deoxy-D-nwnnose epimerase/reductase is selected from the group 

15 consisting of SEQ ID NO: 1, SEQ ID NO:3 and SEQ ID NO:5. 

27. The method of Claim 25, A^erein said GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase comprises an amino add sequence sdected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6, 

28. A method, as claimed in Claim 1, wherein said microorganism is selected 
20 from the group consisting of bacteria, fiingi and microalgae. 

29. A method, as claimed in Claim 1, wherein said microorganism is add- 
tolerant. 

30. A method, as claimed in Claim 1, wherein said microorganism is a 
bacterium. 

25 3 1 . A method, as claimed in Claim 30, wherein said bacterium is selected from 

the group consisting oi Azotobacter and Pseudomonas. 

32. A method, as claimed in Claim 1, wherein said microorganism is a fungus, 

33 . A method, as claimed in Claim 32, wherein said microorganism is a yeast. 

34. A method, as claimed in Claim 33, wherein said yeast is selected from the 
30 group consisting of Saccharomyces yeast. 
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35. A method, as claimed in Claim 1, wherein said microorganism is a 
microalga. 

36, A method, as claimed in Claim 35, wheran said microalga is selected from 
the group consisting of microalgae of the genera Prototheca and Chlorella. 

5 37. A method, as daimed in Claim 36, wherein said microalga is selected from 

the genus Prototheca. 

38. A method, as claimed in Claim 1, wherein said microorganism fiirther 
comprises a genetic modification to decrease the action of an enzyme having GDP-D- 
mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase. 
10 39. A method, as claimed in Claim 38, wherein said genetic modification to 

decrease the action of an enzyme having GDP-I>-mannose as a substrate, other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

40. A method, as claimed in Claim 1, wherein said microorganism is acid- 
15 tolerant and said step of culturing is conducted at a pH of less than about 6.0. 

41. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.5. 

42. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.0. 

20 43. A method, as claimed in Claim 1, wherein said step of culturing is 

conducted in a fermentation medium that is magnesium (Mg) limited. 

44. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that is Mg limited during a cell growth phase. 

45. A method, as claimed in Claim 1, wherein said step of culturing is 
25 conducted in a fermentation medium that comprises less than about 0.5 g/L of Mg during 

a cell growth phase. 

46. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises less than about 0.2 g/L of Mg during 
a cell growth phase. 
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47. A method, as claimed in Claim 1, wherein said step of cuhuring is 
conducted in a fennentadon medium that comprises less than about 0. 1 g/L of Mg during 
a cell growth phase. 

48. A method, as claimed in Claim 1, wherein said step of culturing is 
S conducted in a fermentation medium that comprises a carbon source other than D- 

mannose. 

49. A method, as claimed in Claim 1, whonetn said step of culturing is 
conducted in a fermentation medium that comprises glucose as a carbon source. 

50. A microorganism for producing ascorbic acid or esters thereof wherein 
10 said microorganism has a genetic modification to increase the action of an enzyme 

selected fi-om the group consisting of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannoniutase, GDP-I>-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and L-galactono-y-lactone 
1 5 dehydrogenase. 

51. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of an enzyme selected fi-om 
the group consisting of GDP-D-maimose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose- 1-P-phosphatase, l-galactose dehydrogenase, and L- 

20 galactono-y-lactone dehydrogenase. 

52. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of GDP-D-mannoseiGDP-L- 
galactose epimerase. 

53 . A microorganism, as claimed in Claim 50, v^ierein said microorganism has 
25 been genetically modified to express a recombinant nucleic acid molecule encoding an 

epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase has a tertiary structure having an average root mean square deviation of 
less than about 2.5 A over at least about 25% of Ca positions of the tertiary structure of 
a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
30 coordinates having Brookhaven Protein Data Bank Accession Code Ibws. 
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54. A microorganism, as claimed in Claim 50, wherein said microorganism is 
selected from the group consisting of bacteria, fungi and microalgae. 

55. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a bacterium. 

5 56. A microorganism, as claimed in Claim 55, wherein said bacteriimi is 

selected from the group consisting of Azotobacter and Pseudomonas. 

SI. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a fungus. 

58. A microorganism, as claimed in Claim 57, wherein said microorganism is 

10 a yeast. 

59. A microorganism, as claimed in Claim 58, wherein said yeast is selected 
from the group consisting of Saccharomyces yeast. 

60. A plant for producing ascorbic acid or esters thereoj^ wherein said plant 
has a genetic modification to increcise the action of an enzyme selected from the group 

15 consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-D-mannose pyrophosphorylase, CH)P-D-mannose:GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactono-Y-lactone dehydrogenase. 

61. A plant, as claimed in Claim 60, wherein said genetic modification is a 
20 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-Y-lactone dehydrogenase. 

62. A plant, as clEiimed in Claim 60, wherein said genetic modification is a 
25 genetic modification to increase the action of GDP-D-mannose:GDP-L-gaIactose 

epimerase. 

63. A plant, as claimed in Claim 60, wherein said plant has been genetically 
modified to express a recombinant nucleic acid molecule encoding an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase 

30 has a tertiary structure having an average root mean square deviation of less than about 
2.5 A over at least about 25% of Ca positions of the tertiary structure of a GDP-4-keto-6- 
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deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws. 

64. A plant, as claimed in Claim 60, wherein said plant further comprises a 
genetic modification to decrease the action of an enzyme having GDP-D-mannose as a 

5 substrate other than GDP-D-mannose:GDP-L-gaIactose epimerase. 

65. A plant, as claimed in Claim 60, wherein said genetic modification to 
decrease the action of an en^^mie having GDP-D-mannose as a substrate other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

10 66. A plant, as claimed in Claim 60, wherein said plant is a microalga. 

67. A plant, as claimed in Claim 66, wherein said plant is selected fi-om the 
group consisting of microalgae of the genera Proiotheca and Chlorella. 

68. A plant, as claimed in Claim 66, herein said microalga is selected fi'om 
the genus Proiotheca. 

15 69. A plant, as claimed in Claim 60, wherein said plant is a higher plant. 

70. A plant, as claimed in Claim 60, i?vdierein said plant is a consumable higher 

plant. 

71 . A microorganism for producing ascorbic acid or esters thereof, wherein 
said microorganism has been genetically modified to express a recombinant nucleic acid 

20 molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein said epimerase comprises an amino acid sequence that aligns with SEQ 
ID NO: 11 using a CLUSTAL alignment program, wherein amino acid residues in said 
amino acid sequence align with 100% identity with at least about 50% of non-Xaa 
residues in SEQ ID NO: 1 1 . 

25 72. A plant for producing ascorbic acid or esters thereof, wherein said plant 

has been genetically modified to express a recombinant nucleic acid molecule encoding 
an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 1 1 using 
a CLUSTAL edignment program, wherein amino add residues in said amino acid sequence 

30 align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 



wo 99/64618 



1/12 



PCTAJS99/11576 




wo 99/64618 



PCT/US99/n576 

2/12 




wo 99/64618 



3/12 



PCT/US99/11576 




wo 99/64618 



4/12 



PCTAJS99/I1576 




wo 99/64618 



5/12 



PCT/US99/11576 



CD 
W 
O 



o 




wo 99/64618 



6/12 



PCT/US99/n576 




wo 99/6461 8 PCT/US99/1 1576 

7/12 




"l/6uj 'ppv oiqjoosv 



wo 99/64618 



PCTAJS99/11576 



8/12 




029V sjnjinon 
jsd VV 6uJ 'uoiiBLUJoj VV aJ^psds aBejaAV 



wo 99/64618 



9/12 



PCTAJS99/11576 



039V 8Jn}|non 
J9d w 6lu 'uohblujoj vV oypadg 




O CO CD Tl- CNI O 

o o o o o 



wo 99/64618 PCT/US99/n 576 

10/12 




wo 99/64^1« 



11/12 



PCTA;S99/11576 




-o o 

iO CO 




I 

Q 
CD 



wo 99/64618 



12/12 



PCT/US99/11576 




wo 99/64618 



SEQUENCE LISTING 



PCTAJS99/11576 



<110> Berry, Alan 

Running, .Jeffrey A, 
Severson, David K. 
Burlingame, Richard P . 

<120> "VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS" 

<130> 3161-24-PCT 

<140> not yet assigned 
<141> 1999-05-25 

<150> 60/125,073 
<151> 1999-03-17 

<150> 60/125,054 
<151> 1999-03-18 

<150> 60/088,549 
<151> 1998-06-08 

<160> 15 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 1583 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (49) . . (993) 

<400> 1 

tagtctttaa tttcgcagcg tttttataat tgtgcagagg tttcgtcc atg tct gac 57 

Met Ser Asp 
1 

aaa tct gcc aaa ate ttc gtc gcg ggt cat cgt ggt ttg gtt gga tct 105 
Lys Ser Ala Lys He Phe Val Ala Gly His Arg Gly Leu Val Gly Ser 
5 10 15 



gcc att gtc cgc aag ctt cag gaa caa ggt ttc acc aat etc gtt ctt 
Ala He Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn Leu Val Leu 
20 25 30 35 



153 
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aaa aca cac 

Lys Thr His 



ttc ttt tct 
Phe Phe Ser 



gtt ggt ggt 
Val Gly Gly 
70 

gtc aat etc 
Val Asn Leu 
86 

ggt gtg aag 
Gly Val Lys 
100 

ttt get cct 
Phe Ala Pro 



gaa cca act 
Glu Pro Thr 



act tgt cag 
Thr Cys Gin 
150 

atg cct act 
Met Pro Thr 
165 

cat gtg ctt 

His Val Leu 
180 

tgg age gga 

Trp Ser Gly 



ggg aag ttc 
Gly Lys Phe 



gcc gag ctt 
Ala Glu Leu 
40 

caa gag aag 
Gin Glu Lys 
55 

att cac get 
lie His Ala 



cag att cag 
Gin lie Gin 



aag ctt etc 
Lys Leu Leu 
105 

cag cca att 
Gin Pro lie 
120 

aat gag tgg 
Asn Glu Trp 
135 

get tat agg 
Ala Tyr Arg 

aat etc tat 
Asn Leu Tyr 



cct get ctt 
Pro Ala Leu 
185 

gga agt tgt 
Gly Ser Cys 
200 

ttg cat gtt 
Leu His Val 
215 



gat etc act 
Asp Leu Thr 



cca gtt tat 
Pro Val Tyr 
60 

aac aac ace 
Asn Asn Thr 
75 

acc aat gtg 
Thr Asn Val 
90 

ttc ctt gga 
Phe Leu Gly 



cct gag tct 
Pro Glu Ser 



tat get att 
Tyr Ala lie 
140 

att cag cac 
lie Gin His 
155 

ggt cct aat 
Gly Pro Asn 
170 

atg agg agg 
Met Arg Arg 



ggt gtg ggg 
Gly Val Gly 



gat gat ttg 
Asp Asp Leu 
22X) 



cgt caa gcc 
Arg Gin Ala 
45 

gta ate eta 
Val lie Leu 



tat cct get 
Tyr Pro Ala 



ate cac tct 
lie His Ser 
95 

tea tec tgc 
Ser Ser Cys 
110 

get ttg tta 
Ala Leu Leu 
125 

get aag ate 
Ala Lys lie 



gga tgg gat 
Gly Txrp Asp 

gac aat ttc 
Asp Asn Phe 
175 

ttc cac gag 
Phe His Glu 
190 

tac aag gta 
Tyr Lys Val 
205 

get gat get 
Ala Asp PiXa 



gat gtt gaa 
Asp Val Glu 
50 

gca gca get 
Ala Ala Ala 
65 

gat ttc att 
Asp Phe lie 

eo 

gca tat gag 
Ala Tyr Glu 



att tac cct 
lie Tyr Pro 



aca gca teg 
Thr Ala Ser 
130 

get ggg att 
Ala Gly lie 
145 

gca ate tct 
^ULa lie Ser 
160 

cac ccg gag 
His Pro Glu 



gcg aaa gtg 
Ala Lys Val 



gtc ccg ttg 
Val Pro Leu 
210 

tgt gtt ttc 
Cys Val Phe 
225 
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tec 201 
Ser 



aaa 249 
Lys 



ggt 297 
Gly 



cac 345 
His 



aaa 393 

Lys 

115 

ctt 441 
Leu 

aag 489 
Lys 



ggc 537 
Gly 

tct 585 
Ser 



aat 633 

Asn 

195 

gaa 681 
Glu 



ttg 729 
Leu 



2 
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ctg gat egg ata cag egg ggg ttg gag cat gtt aac att gga agt ggt 777 
Leu Asp Arg lie Gin Arg Gly Leu Glu His Val Asn lie Gly Ser Gly 
230 235 240 

caa gaa gtg act att aga gag ttg get gag ttg gtg aaa gag gtt gtt 825 
Gin Glu Val Thr lie Arg Glu Leu Ala Glu Leu Val Lys Glu Val Val 
245 250 255 

ggt ttt gaa ggg aag ctt gga tgg gat tgc act aag cca gat ggc aca 873 
Gly Phe Glu Gly Lys Leu Gly Trp Asp Cys Thr Lys Pro Asp Gly Thr 
260 265 270 275 

ccg agg aaa ctt atg gac age tea aag etc gcg tct ttg ggt tgg aca 921 
Pro Arg Lys Leu Met Asp Ser Ser Lys Leu Ala Ser Leu Gly Trp Thr 
280 285 290 

cct aag gtt tct ctt aga gat ggt ctg age caa act tat gat tgg tat 969 
Pro Lys Val Ser Leu Arg Asp Gly Leu Ser Gin Thr Tyr Asp Trp Tyr 
295 300 305 

ttg aag aat gtt tgc aac ega taa gttaatggtt tctcttctca tatatacaca 1023 
Leu Lys TVsn Val Cys Asn Arg 

310 315 

actattgagt ctcaggtaaa tcagcttate accacattgt gatttaaacc tttctttgag 1083 

attcgagaat tgcttttttt tttatcaaaa ttgattcatt tagagataag acttgcttct 1143 

ttatacaaea ttgtctgagg aattttaatt ttggatctcc gagtatggtc tattattagc 1203 

tctcttctat acaaattatc aaaacagttg taagaagttt caagaaaaac atttgatatc 1263 

tcactaattt ggctatcctt gcaagttgca acgctaaaat gacaaataat gaattctcgg 1323 

cccaatgggc ttacacaagc cttgttaaag atagcgtgaa caaaacgcgg ctcactagcc 1383 

ctaacctgtc tetctttcgc ttaccttctt ettcgtettc gttggctcag tcacttgact 1443 

tcacggcccg etcaagctet gacacgaaac tcatttcaaa ttaatttaat aaaaccttaa 1503 

tcacaaaagg ggcaaaagca atcgccggcg attatgeett ctcctccggt gccggagaeg 1563 

gttgtgagcc aacccgttcg 1583 



<210> 2 
<211> 314 



3 
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<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ser Asp Lys Ser Ala Lys lie Phe Val Ala Gly His Arg Gly Leu 
15 10 15 

Val Gly Ser Ala lie Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn 
20 25 30 

Leu Val Leu Lys Thr His Ala Glu Leu Asp Leu Thr Arg Gin ;\JLa Asp 
35 40 45 

Val Glu Ser Phe Phe Ser Gin Glu Lys Pro Val Tyr Val lie Leu Ala 
50 55 60 

Ala Ala Lys Val Gly Gly lie His Ala Asn Asn Thr Tyr Pro Ala Asp 
65 70 75 80 

Phe lie Gly Val Asn Leu Gin lie Gin Thr Asn Val lie His Ser Ala 
85 90 95 

Tyr Glu His Gly Val Lys Lys Leu Leu Phe Leu Gly Ser Ser Cys lie 
100 105 110 

Tyr Pro Lys Phe Ala Pro Gin Pro lie Pro Glu Ser Ala Leu Leu Thr 
115 120 125 

Ala Ser Leu Glu Pro Thr Asn Glu Trp Tyr Ala lie Ala Lys lie Ala 
130 135 140 

Gly lie Lys Thr Cys Gin Ala Tyr Arg lie Gin His Gly Trp Asp Ala 
145 150 155 160 

lie Ser Gly Met Pro Thr Asn Leu Tyr Gly Pro Asn Asp Asn Phe His 
165 170 175 

Pro Glu Ser His Val Leu Pro Ala Leu Met Arg Arg Phe His Glu Ala 
180 185 190 

Lys Val Asn Trp Ser Gly Gly Ser Cys Gly Val Gly Tyr Lys Val Val 
195 200 205 

Pro Leu Glu Gly Lys Phe Leu His Val Asp Asp Leu Ala Asp Ala Cys 
210 215 220 

Val Phe Leu Leu Asp Arg lie Gin Arg Gly Leu Glu His Val Asn He 
225 230 235 240 



4 
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Giy Ser Gly Gin Glu Val Thr lie 
245 

Glu Val Val Gly Phe Glu Gly Lys 
260 

Asp Gly Thr Pro Arg Lys Leu Met 

275 280 

Gly Trp Thr Pro Lys Val Ser Leu 
290 295 

Asp Trp Tyr Leu Lys Asn Val Cys 
305 310 



Arg Glu Leu Ala Glu Leu Val Lys 
250 255 

Leu Gly Trp Asp Cys Thr Lys Pro 
265 270 

Asp Ser Ser Lys Leu Ala Ser Leu 
285 

Arg Asp Gly Leu Ser Gin Thr Tyr 
300 

Asn Arg 



<210> 3 
<211> 966 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (966) 

<400> 3 

atg agt aaa caa cga gtt ttt att get ggt cat cgc ggg atg gtc ggt 48 

Met Ser Lys Gin Arg Val Phe lie Ala Gly His Arg Gly Met Val Gly 

15 10 15 

tec gcc ate agg egg cag etc gaa cag cgc ggt gat gtg gaa ctg gta 96 
Ser Ala lie Arg Axg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 
20 25 30 

tta cgc acc cgc gac gag ctg aac ctg ctg gac age cgc gcc gtg cat 144 
Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Axg Ala Val His 
35 40 45 

gat ttc ttt gcc age gaa cgt att gac cag gtc tat ctg gcg gcg gcg 192 
Asp Phe Phe Ala Ser Glu Arg lie Asp Gin Val Tyr Leu TVla Ala Ala 
50 55 60 

aaa gtg ggc ggc att gtt gee aac aac acc tat ccg gcg gat etc ate 240 
Lys Val Gly Gly lie Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie 
65 70 75 80 

tac cag aac atg atg att gag age aac ate att cac gcc gcg cat cag 280 



5 
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Tyr Gin Asn Met Met lie Glu Ser Asn He He His Ala Ala His Gin 
85 90 95 

aac gac gtg aac aaa ctg ctg ttt etc gga teg tec tgc ate tac ecg 336 
Asn Asp Val Asn L»ys Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro 
100 105 110 

aaa ctg gca aaa cag ccg atg gca gaa age gag ttg ttg cag ggc acg 384 
Lys Leu Ala Lys Gin Pro Met Ala Glu Ser Glu Leu Leu Gin Gly Thr 
115 120 125 

ctg gag ccg act aac gag cct tat get att gee aaa ate gcc ggg ate 432 
Leu Glu Pro Thr Asn Glu Pro Tyr Ala He Ala Lys He Ala Gly He 
130 135 140 

aaa ctg tgc gaa tea tac aac cgc cag tac gga cgc gat tac cgc tea 480 
Lys Leu Cys Glu Ser Tyr Asn Arg Gin Tyr Gly Arg Asp Tyr Arg Ser 
145 150 155 160 

gtc atg ccg acc aac ctg tac ggg cea cac gac aac ttc cac ccg agt 528 
Val Met Pro Thr Asn Leu Tyr Gly Pro His Asp Asn Phe His Pro Ser 
165 170 175 

aat teg cat gtg ate eea gca ttg ctg egt cgc ttc cac gag geg acg 576 
Asn Ser His Val He Pro Ala Leu Leu Arg Arg Phe His Glu Ala Thr 
180 185 190 

gca cag aat gcg ccg gac gtg gtg gta tgg ggc age ggt aca ccg atg 624 
Ala Gin Asn Ala Pro Asp Val Val Val Trp Gly Ser Gly Thr Pro Met 
195 200 205 

cgc gaa ttt ctg cac gtc gat gat atg gcg geg gcg age att eat gtc 672 
Arg Glu Phe Leu His Val Asp Asp Met Ala Ala TVla Ser He His Val 
210 215 220 

atg gag ctg gcg cat gaa gtc tgg ctg gag aac acc cag ccg atg ttg 720 
Met Glu Leu Ala His Glu Val Trp Leu Glu Asn Thr Gin Pro Met Leu 
225 230 235 240 

teg cac att aac gtc ggc acg ggc gtt gac tgc act ate cgc gac gtg 768 
Ser His He Asn Val Gly Thr Gly Val Asp Cys Thr He Arg Asp Val 
245 250 255 

gcg caa acc ate gcc aaa gtg gtg ggt tac aaa ggc egg gtg gtt ttt 816 
Ala Gin Thr He Ala Lys Val Val Gly Tyr Lys Gly Arg Val Val Phe 
260 265 270 

gat gcc age aaa ccg gat ggc acg ccg cgc aaa ctg ctg gat gtg acg 864 
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Asp Ala Ser hys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp Val Thr 
275 280 285 

cgc ctg cat cag ctt ggc tgg tat cac gaa ate tea ctg gaa gcg ggg 912 

Arg Leu His Gin Leu Gly Trp Tyr His Glu lie Ser Leu Glu Ala Gly 

290 295 300 



960 



ctt gcc age act tac cag tgg tte ctt gag aat caa gac cgc ttt egg 
Leu Ala Ser Thr Tyr Gin Trp Phe Leu Glu Asn Gin Asp Arg Phe Arg 
305 310 315 320 

ggg taa 966 
Gly 



<210> 4 

<211> 321 

<212> PRT 

<213> Escherichia coli 

<400> 4 

Met Ser Lys Gin Arg Val Phe lie Ala Gly His Arg Gly Met Val Gly 
1 5 10 15 

Ser Ala lie Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 
20 25 30 

Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 45 

Asp Phe Phe Ala Ser Glu Arg lie Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 

Lys Val Giy Gly lie Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie 
65 70 75 80 

Tyr Gin Asn Met Met lie Glu Ser Asn lie lie His Ala Ala His Gin 
85 90 95 

Asn Asp Val Asn Lys Leu Leu Phe Leu Gly Ser Ser Cys lie Tyr Pro 
100 105 110 

Lys Leu Ala Lys Gin Pro Met Ala Glu Ser Glu Leu Leu Gin Gly Thr 
115 120 125 

Leu Glu Pro Thr Asn Glu Pro Tyr Ala lie Ala Lys lie Ala Gly lie 
130 135 140 
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Lys Leu Cys Glu Ser Tyr TVsn Arg Gin Tyr Gly Arg Asp Tyr Arg Ser 
145 150 155 160 



Val Met Pro Thr 



Asn Ser His Val 
180 

Ala Gin Asn Ala 
195 

Arg Glu Phe Leu 
210 

Met Glu Leu Ala 
225 

Ser His lie Asn 



Ala Gin Thr lie 
260 

Asp Ala Ser Lys 
275 

Arg Leu His Gin 
290 

Leu Ala Ser Thr 
305 

Gly 



Asn Leu Tyr Gly 
165 

lie Pro Ala Leu 



Pro Asp Val Val 
200 

His Val Asp Asp 
215 

His Glu Val Trp 
230 

Val Gly Thr Gly 
245 

Ala Lys Val Val 



Pro Asp Gly Thr 
280 

Leu Gly Trp Tyr 
295 

Tyr Gin Trp Phe 
310 



Pro His Asp Asn 
170 

Leu Arg Arg Phe 
185 

Val Trp Gly Ser 



Met Ala Ala Ala 
220 

Leu Glu Asn Thr 
235 

Val Asp Cys Thr 
250 

Gly Tyr Lys Gly 
265 

Pro Arg Lys Leu 



His Glu lie Ser 
300 

Leu Glu Asn Gin 
315 



Phe His Pro Ser 
175 

His Glu Ala Thr 
190 

Gly Thr Pro Met 
205 

Ser He His Val 



Gin Pro Met Leu 
240 

He Arg /^p Val 
255 

Arg Val Val Phe 
270 

Leu Asp Val Thr 
285 

Leu Glu Ala Gly 



Asp Arg Phe Arg 
320 



<210> 5 

<211> 1340 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (75) . . (1040) 

<400> 5 

ctagaattca gcggccgctg aattctagct agaattcagc ggccgctgaa ttctagaacc 60 
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caggtgcaac tgac atg ggt gaa ccc cag gga tec atg egg att eta gtg 110 

Met Gly Glu Pro Gin Gly Ser Met Arg He Leu Val 

15 10 

aca ggg ggc tct ggg ctg gta ggc aaa gcc ate cag aag gtg gta gca 158 
Thr Gly Gly Ser Gly Leu Val Gly Lys Ala He Gin Lys Val Val Ala 
15 20 25 

gat gga get gga ctt cot gga gag gac tgg gtg ttt gtc tec tct aaa 206 
Asp Gly Ala Gly Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys 
30 35 40 

gac gcc gat etc acg gat aca gca cag acc cgc gee ctg ttt gag aag 254 
Asp Ala Asp Leu Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys 
45 50 55 60 

gtc caa ccc aca cac gtc ate eat ctt get gca atg gtg ggg ggc ctg 302 
Val Gin Pro Thr His Val lie His Leu Ala Ala Met Val Gly Gly Leu 
65 70 75 

ttc egg aat ate aaa tac aat ttg gac ttc tgg agg aaa aac gtg cac 350 
Phe Arg Asn He Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His 
80 85 90 

atg aac gac aac gtc ctg cac teg gcc ttt gag gtg ggg gcc cgc aag 398 
Met Asn Asp Asn Val Leu His Ser Ala Phe Glu Val Gly Ala. Arg Lys 
95 100 105 

gtg gtg tec tgc ctg tec acc tgt ate ttc cct gac aag acg acc tac 44 6 
Val Val Ser Cys Leu Ser Thr Cys He Phe Pro Asp Lys Thr Thr Tyr 
110 115 120 

ccg ata gat gag acc atg ate cac aat ggg cct ccc cac aac age aat 494 
Pro He Asp Glu Thr Met He His Asn Gly Pro Pro His Asn Ser Asn 
125 130 135 140 

ttt ggg tac teg tat gcc aag agg atg ate gac gtg cag aac agg gcc 542 
Phe Gly Tyr Ser Tyr Ala Lys Arg Met He Asp Val Gin Asn Arg Ala 
145 150 155 

tac ttc cag cag tac ggc tgc acc ttc acc get gtc ate ccc acc aac 590 
Tyr Phe Gin Gin Tyr Gly Cys Thr Phe Thr Ala Val He Pro Thr Asn 
160 165 170 

gtt ttc ggg ccc cac gac aac ttc aac ate gag gat ggc cac gtg ctg 638 
Val Phe Gly Pro His Asp Asn Phe Asn He Glu Asp Gly His Val Leu 
175 180 185 
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cct ggc etc ate eac aag gtg cac ctg gcc aag age age ggc teg gee 686 

Pro Gly Leu lie His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala 
190 195 200 

ctg acg gtg tgg ggt aca ggg aat ccg egg agg cag ttc ata tac teg 734 
Leu Thr Val Trp Gly Thr Gly Asn Pro Arg Arg Gin Phe lie Tyr Ser 
205 210 215 220 

ctg gac ctg gee cag etc ttt ate tgg gtc ctg egg gag tac aat gaa 782 
Leu Asp Leu Ala Gin Leu Phe lie Trp Val Leu Arg Glu Tyr Asn Glu 
225 230 235 

gtg gag ccc ate ate etc tec gtg ggc gag gaa gat gag gtc tec ate 830 
Val Glu Pro lie lie Leu Ser Val Gly Glu Glu Asp Glu Val Ser He 
240 245 250 

aag gag gea gcc gag gcg gtg gtg gag gee atg gac ttc cat ggg gaa 878 
Lys Glu Ala Ala Glu TVla Val Val Glu Ala Met Asp Phe His Gly Glu 
255 260 265 

gtc acc ttt gat aca ace aag teg gat ggg cag ttt aag aag aca gcc 926 
Val Thr Phe Asp Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala 
270 275 280 

agt aac age aag ctg agg ace tac ctg ccc gac ttc egg ttc aca ccc 974 
Ser Asn Ser Lys Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro 
285 290 295 300 

ttc aag cag gcg gtg aag gag acc tgt get tgg ttc act gac aac tac 1022 
Phe Lys Gin Ala Val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr 
305 310 315 

gag cag gcc egg aag tga agctggaaga caggatcagg tgccagcgga 1070 
Glu Gin Ala Arg Lys 
320 

ccatcggctg gcagagccca gcggccacca cccgtcaacc ctgccaggag ctgagggcac 1130 

cacccagcaa cctgggcctg eattccatcc gctetgcagc cccaagcatc tttceagtgg 1190 

ggcccccatt cacgttggtc ctcagggaaa ecagggtceg gggeaggeec ggcgetttgc 1250 

tccccacacc agecccctgc gcgtgtccac tctgatcctg cateccaetc cctgggagcc 1310 

aataaagtgc attttcacag aaaaaaaaaa 1340 

<210> 6 
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<211> 321 
<212> PRT 
<213> Homo sapiens 

<400> 6 

Met Gly Glu Pro Gin Gly Ser Met Arg He Leu Vai Thr Gly Giy Ser 
15 10 15 

Gly Leu Val Gly Lys Ala He Gin Lys Val Val Ala Asp Gly Ala Gly 
20 25 30 

Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
35 40 45 

Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
50 55 60 

His Val He His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn He 
65 70 75 80 

Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 
85 90 95 

Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 
100 105 110 

Leu Ser Thr Cys He Phe Pro Asp Lys Thr Thr Tyr Pro He Asp Glu 
115 120 125 

Thr Met He His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
130 135 140 

Tyr Ala Lys Arg Met He Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin 
145 150 155 160 

Tyr Gly Cys Thr Phe Thr Ala Val He Pro Thr Asn Val Phe Gly Pro 
165 170 175 

His Asp Asn Phe Asn He Glu Asp Gly His Val Leu Pro Gly Leu He 
180 185 190 

His Lys Vai His Leu Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp 
195 200 205 

Gly Thr Gly Asn Pro Arg Arg Gin Phe He Tyr Ser Leu Asp Leu Ala 
210 215 220 

Gin Leu Phe He Trp Val Leu Arg Glu Tyr Asn Glu Val Glu Pro He 
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225 230 235 240 

lie Leu Ser Val Gly Glu Glu Asp Glu Val Ser lie Lys Glu Ala Ala 
245 250 255 

Glu Ala Val Val Glu Ala Met Asp Phe His Gly Glu Val Thr Phe Asp 
260 265 270 

Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala Ser Asn Ser Lys 
275 280 285 

Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro Phe Lys Gin Ala 
290 295 300 

Val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr Glu Gin Ala Arg 
305 310 315 320 

Lys 



<210> 7 

<211> 1017 

<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) , . (1017) 

<400> 7 

atg aga gtt ctg gtt acc ggt ggt age ggt tac att gga agt cat acc 48 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr lie Gly Ser His Thr 
15 10 15 

tgt gtg caa tta ctg caa aac ggt cat gat gtc ate att ctt gat aac 96 
Cys Val Gin Leu Leu Gin Asn Gly His Asp Val lie lie Leu Asp Asn 
20 25 30 

etc tgt aac agt aag cgc age gta ctg cct gtt ate gag cgt tta ggc 144 
Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val lie Glu Arg Leu Gly 
35 40 45 

ggc aaa cat cca acg ttt gtt gaa ggc gat att cgt aac gaa gcg ttg 192 
Gly Lys His Pro Thr Phe Val Glu Gly Asp lie Arg Asn Glu Ala Leu 
50 55 60 

atg acc gag ate ctg cac gat cac get ate gac acc gtg ate cae ttc 240 
Met Thr Glu lie Leu His Asp His Ala lie Asp Thr Val lie His Phe 
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65 70 75 80 

gcc ggg ctg aaa gcc gtg ggc gaa teg gta caa aaa ccg ctg gaa tat 288 
Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 
85 90 95 

tac gac aac aat gtc aac ggc act ctg cgc ctg att age gcc atg cgc 336 
Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu He Ser Ala Met Arg 
100 105 110 

gcc get aac gtc aaa aac ttt att ttt age tec tec gcc aec gtt tat 384 
Ala Ala Asn Val Lys Asn Phe He Phe Ser Ser Ser Ala Thr Val Tyr 
115 120 125 

ggc gat cag ccc aaa att cca tac gtt gaa age ttc ccg aec ggc aca 432 
Gly Asp Gin Pro Lys He Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

ccg caa age cct tac ggc aaa age aag ctg atg gtg gaa cag ate etc 480 
Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin He Leu 
145 150 155 160 

acc gat ctg caa aaa gcc cag ccg gac tgg age att gcc ctg ctg cgc 528 
Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 
165 170 175 

tac ttc aac ccg gtt ggc gcg cat ccg teg ggc gat atg ggc gaa gat 576 
Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 
180 185 190 

ccg caa ggc att ccg aat aac ctg atg cca tac ate gee cag gtt get 624 
Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 

gta ggc cgt cgc gac teg ctg gcg att ttt ggt aac gat tat ccg acc 672 
Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

gaa gat ggt act ggc gta cgc gat tac ate cac gta atg gat ctg gcg 720 
Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 

gac ggt cac gtc gtg gcg atg gaa aaa ctg gcg aac aag cca ggc gta 768 
Asp Gly His Val Val Ala Met Glu Lys Leu TVla Asn Lys Pro Gly Val 
245 250 255 

cac ate tac aac etc ggc get ggc gta ggc aac age gtg ctg gac gtg 816 
His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 
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260 



265 



PCT/US99/11576 

270 



gtt aat gcc ttc age aaa 
Val Asn Ala Phe Ser Lys 
275 

gca ccg cgt cgc gag ggc 
Ala Pro Arg Arg Glu Gly 
290 

aaa gcc gac cgt gaa ctg 
Lys Ala Asp Arg Glu Leu 
305 310 

atg gcg cag gac acc tgg 
Met Pila Gin Asp Thr Trp 
325 

COG gat taa 
Pro Asp 



gcc tgc ggc aaa ccg gtt 
7\la Cys Gly Lys Pro Val 
260 

gac ctt ccg gcc tac tgg 
Asp Leu Pro Ala Tyr Trp 
295 300 

aac tgg cgc gta acg cgc 
Asn Trp Arg Val Thr Arg 
315 

cac tgg cag tea cgc cat 
His Trp Gin Ser Arg His 
330 



aat tat cat ttt 864 

Asn Tyr His Phe 

285 

gcg gac gcc age 912 
Ala Asp Ala Ser 



aca etc gat gaa 960 
Thr Leu Asp Glu 
32 0 

cca cag gga tat 1008 
Pro Gin Gly Tyr 
335 

1017 



<210> 8 
<211> 338 
<212> PRT 

<213> Escherichia coli 



<400> 8 

Met Arg Val Leu 
1 

Cys Val Gin Leu 
20 

Leu Cys Asn Ser 
35 



Val Thr Gly Gly 
5 

Leu Gin Asn Gly 



Lys Arg Ser Val 
40 



Ser Gly Tyr lie 
10 

His Asp Val lie 
25 

Leu Pro Val lie 



Gly Ser His Thr 
15 

He Leu Asp Asn 
30 

Glu Arg Leu Gly 
45 



Gly Lys His Pro Thr 
50 

Met Thr Glu lie Leu 
65 

Ala Gly Leu Lys Ala 
85 

Tyr Asp Asn Asn Val 
100 



Phe Val Glu Gly Asp He 
55 

His Asp His Ala He Asp 
70 75 

Val Gly Glu Ser Val Gin 
90 

Asn Gly Thr Leu Arg Leu 
105 



Arg Asn Glu Ala Leu 
60 

Thr Val He His Phe 
80 

Lys Pro Leu Glu Tyr 
95 

He Ser Ala Met Arg 
110 
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Ala Ala Asn Val hys Asn Phe lie Phe Ser Ser Ser Ala Thr Val Tyr 
115 120 125 

Gly Asp Gin Pro hys He Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin He Leu 
145 150 155 160 

Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 
165 170 175 

Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 
180 185 190 

Pro Gin Gly lie Pro TVsn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 

Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 

Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 
245 250 255 

His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 
260 265 270 

Val Asn Ala Phe Ser Lys Ala Cys Gly Lys Pro Val Asn Tyr His Phe 
275 280 285 

Ala Pro Arg Arg Glu Gly Asp Leu Pro Ala Tyr Trp Ala Asp Ala Ser 
290 295 300 

Lys Ala Asp Arg Glu Leu Asn Trp Arg Val Thr Arg Thr Leu Asp Glu 
305 310 315 320 

Met Ala Gin Asp Thr Trp His Trp Gin Ser Arg His Pro Gin Gly Tyr 
325 330 335 

Pro Asp 



<210> 9 
<211> 1047 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1047) 

<400> 9 

atg gca gag aag gtg ctg gta aca ggt ggg get ggc tac att ggc age 48 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
1 5 10 . 15 

cac acg gtg ctg gag ctg ctg gag get ggc tac ttg cct gtg gtc ate 96 
His Thr Val Leu Glu Leu Leu Glu JVla Gly Tyr Leu Pro Val Val lie 
20 25 30 

gat aac ttc cat aat gee ttc cgt gga ggg ggc tec ctg cct gag age 144 
Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu Ser 
35 40 45 

ctg egg egg gtc cag gag ctg aca ggc cgc tct gtg gag ttt gag gag 192 
Leu Arg Arg Val Gin Glu Leu Thr Gly Arg ser Val Glu Phe Glu Glu 
50 55 60 

atg gac att ttg gac cag gga gcc eta cag cgt etc ttc aaa aag tac 240 
Met Asp lie Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 

age ttt atg gcg gtc ate cac ttt gcg ggg etc aag gcc gtg ggc gag 288 
Ser Phe Met Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu 
65 90 95 

teg gtg cag aag cct ctg gat tat tac aga gtt aac ctg acc ggg acc 336 
Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 
100 105 110 

ate cag ctt ctg gag ate atg aag gcc cac ggg gtg aag aac ctg gtg 384 
lie Gin Leu Leu Glu lie Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

ttc age age tea gee act gtg tac ggg aac ccc cag tac ctg cce ctt 432 
Phe Ser Ser Scr Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 

gat gag gee cac ccc acg ggt ggt tgt acc aac cct tac ggc aag tee 480 
Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys Ser 
145 150 155 160 
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aag ttc ttc ate gag gaa atg ate egg gac etg tgc eag gca gac aag 528 
Lys Pho Pho He Glu Glu Met He Arg Pisp Leu Cys Gin Ala Asp Lys 
165 170 175 



aet tgg aac gta gtg ctg etg cgc tat ttc aac ccc aca ggt gcc cat 576 

Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 
180 185 190 

gcc tct ggc tgc att ggt gag gat ccc cag ggc ata ccc aac aac etc 624 

Ala Scr Gly Cys He Gly Glu Asp Pro Gin Gly He Pro Asn Asn Leu 
195 200 205 



atg cct tat gtc tec cag gtg gcg ate ggg cga egg gag gcc ctg aat 672 
Met Pro Tyr Val Ser Gin Val Ala He Gly Arg Arg Glu Ala Leu Asn 
210 215 220 



gtc ttt ggc aat gac tat gac aca gag gat ggc aca ggt gtc egg gat 720 
Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

tac ate cat gtc gtg gat etg gcc aag ggc cac att gca gcc tta agg 768 
Tyr He His Val Val Asp Leu Ala Lys Gly His He Ala Ala Leu Arg 
245 250 255 



aag ctg aaa gaa eag tgt ggc tgc egg ate tac aac etg ggc acg ggc 816 
Lys Leu Lys Glu Gin Cys Gly Cys Arg He Tyr Asn Leu Gly Thr Gly 
260 265 270 



aca ggc tat tea gtg ctg cag atg gtc cag get atg gag aag gcc tct 864 
Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 
275 280 285 



ggg aag aag ate ccg tac aag gtg gtg gca egg egg gaa ggt gat gtg 912 
Gly Lys Lys He Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 



gca gcc tgt tac gcc aac ccc age ctg gee eaa gag gag etg ggg tgg 960 
TVla Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 



aca gca gcc tta ggg ctg gac agg atg 
Thr Ala Ala Leu Gly Leu Asp Arg Met 
325 



tgt gag gat etc tgg cgc tgg 1008 
Cys Glu Asp Leu Trp Arg Trp 
330 335 



cag aag cag aat cct tea ggc ttt ggc acg caa gcc tga 
Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 
340 345 
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<210> 10 
<211> 348 
<212> PRT 

<213> Horao sapiens 
<400> 10 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
15 10 15 

His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val lie 
20 25 30 

Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu ser 
35 40 45 

Leu Arg Arg Val Gin Glu Leu Thr Gly Arg Ser Val Glu Phe Glu Glu 
50 55 60 

Met Asp lie Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 

Ser Phe Met Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu 
85 90 95 

Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 
100 105 110 

lie Gin Leu Leu Glu lie Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

Phe Ser Ser Ser Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 

Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys Ser 
145 150 155 160 

Lys Phe Phe lie Glu Glu Met lie Arg Asp Leu Cys Gin Ala Asp Lys 
165 170 175 

Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 
180 185 190 

Ala Ser Gly Cys lie Gly Glu Asp Pro Gin Gly lie Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr Val Ser Gin Val Ala lie Gly Arg Arg Glu Ala Leu Asn 
210 215 220 



18 



wo 99/64618 PCT/US99/11576 

Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

Tyr He His Val Val Asp Leu Ala Lys Gly His He Ala Ala Leu 7\xg 
245 250 255 

Lys Leu Lys Glu Gin Cys Gly Cys Arg He Tyr Asn Leu Gly Thr Gly 
260 265 270 

Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala ser 
275 280 285 

Gly Lys Lys He Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 

Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 

Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 
325 330 335 

Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 
340 345 



<210> 11 
<211> 317 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : CONSENSUS 
<400> 11 

Xaa Xaa Arg Xaa Xaa Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Ala Xaa Xaa Xaa 
50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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65 70 75 80 

Asm Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
100 105 110 

Xaa Xaa Xaa Pro Xaa Xaa Glu Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 

Xaa Xaa Asn Xaa Xaa Gly Xaa His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
165 170 175 

Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
180 185 190 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Gly Xaa Xaa Xaa Arg Xaa 
195 200 205 

Xaa Xaa Xaa Xaa Xaa Asp Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
210 215 220 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
225 230 235 240 

Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
245 250 255 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa 
260 265 270 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
275 280 285 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Thr Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 
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<210> 12 
<211> 34 
<212> DNA 

<213> Escherichia coli 
<400> 12 

tagaattcag taaacaacga gtttttattg ctgg 34 

<210> 13 
<211> 32 
<212> DNA 

<213> Escherichia coli 



<210> 14 
<211> 30 
<212> DNA 

<213> Escherichia coli 
<400> 14 

ctggagtcga attcatgagt aaacaacgag 30 

<210> 15 
<211> 33 
<212> DNA 

<213> Escherichia coli 



<400> 13 



aactcgagt-t acccccaaag cggtcttgat tc 



32 



<400> 15 



aactgcagtt acccccgaaa gcggtcttga ttc 



33 
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